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PROVISIONAL SPECIFICATION 

Invention Title: 



Genes involved in plant fibre development 
The invention is described in the following statement: 



GENES INVOLVED IN PLANT FIBRE DEVELOPMENT 



FIELD OF THE INVENTION 

The present invention relates to polypeptides, and polynucleotides encoding 
therefor, involved in the regulation of fibre initiation and/or elongation in fibre 
producing plants. In particular, the present invention provides methods of altering fibre 
initiation in cotton and products thereof. The invention also relates to the use of these 
polypeptides and polynucleotides as markers of fibre production in plants including 
cotton. 

BACKGROUND OF THE INVENTION 

Cotton {Gossypium hirsutum, and to a lesser extent Gossypium barbadense 
together with other Gossypium species) provides about 55% of the fibre used in textile 
manufacturing globally and is an important contributor to world economies. The cotton 
fibre is probably the most elongated cell in the plant kingdom. The molecular 
mechanisms that control the differentiation of this elongated plant cell are still largely 
unknown. Although commonly called fibres, these cells are not part of the vascular 
tissue and arise, instead, from the ovule epidermis. Fibres of cotton are extremely long 
single elongated epidermal cells that develop on the outer surface of cotton ovules, 
reaching upwards of 5 centimetres in some species. Fibre initiation starts between a day 
before and up to a day after anthesis and the fibre initials begin to elongate immediately 
after fertilisation, ballooning out from the surface of the seed coat epidermis. After a 
period of elongation, secondary cell wall thickening fills the fibre with cellulose and the 
.fibre dies and collapses to form the mature fibre that is harvested from the seeds. 

In contrast to the discovery of numerous genes responsible for fibre elongation 
and secondary cell wall synthesis, few genes have been identified that are associated 
with fibre initiation. Early cytological studies showed structural changes in fibre initials 
occur up to three days before anthesis: including enlarged nucleoli and nuclei, as well as 
an increased number of Golgi complexes (Berlin, 1986). It has been proposed that the 
Arabidopsis leaf trichomes, which require at least twenty genes for normal development 
(Hulskamp et al., 1994), could serve as a model for elucidating the genetic mechanisms 
controlling cotton fibre initiation and differentiation. 

One of the first genes to be characterised in controlling leaf trichomes, 
GLABROUS 1 (GL1), encodes a member of the Myb family of transcription factors 
(Oppenheimer et al., 1991). An exhaustive search of a cotton ovule cDNA library 
recovered six novel Myb-domain genes, but none of them encoded a GL1 homolog 
(Loguercio et al., 1999). Arabidopsis trichome initiation is proposed to be controlled by 
a trichome promoting complex comprised of GL1, TRANSPARENT TESTA 
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GLABRA1 (TTG1, a WD40 protein), and GLABRA3 (GL3, a basic Helix-Loop-Helix 
protein). GLABRA2 (GL2, a Homeodomain protein) regulates trichome morphology 
and spacing and TRIPTYCHON (TRY, a Myb-like protein) mediates lateral inhibition 
of trichome development in cells adjacent to each trichome (Rerie et al., 1994; Walker 
5 et al., 1999; Szymanski et al., 2000; Schellmann et al., 2002; Ohashi et al, 2002). 
However, genes with similar functions in cotton have yet to be identified, and hence it 
remains speculative whether these two single celled epidermal hair systems share any 
common features. 

There is a need for the identification and characterization of genes involved in 
1 0 fibre initiation in fibre producing plants such as cotton. This will enable markers to be 
used to screen plants for desirable fibre traits, as well as allow for the production of 
transgenic plants with altered fibre production. 

SUMMARY OF THE INVENTION 

15 To identify genes that may be specific to fibre initiation, the present inventors 

have used mRNA from early stage fertilised ovules of wild type and 5 lintless mutants 
of cotton (that produce little if any fibres) to probe a cotton ovule cDNA microarray 
containing 10,000 cDNAs expressed around the time of fibre cell differentiation. Since 
pollination may already have occurred and zygote development initiated at this stage, 

20 the inventors used a separate microarray comparison between the mRNAs of the outer 
integument and those of the inner ovule tissues of the wild type cotton to filter out those 
genes that are not expressed specifically in the seed coat outer integument where the 
fibres are initiated. Using this strategy genes have been identified that are differentially 
expressed in the lintless mutants, and hence play a role in fibre initiation. 

25 As a result, in one aspect the present invention provides a method of altering 

fibre initiation and/or elongation "in a fibre producing plant comprising manipulating 
said plant such that the production of a polypeptide is modified when compared to a 
wild-type plant, wherein the polypeptide comprises a sequence selected from the group 
consisting of: 

30 i) an amino acid sequence provided as any one of SEQ ID NO's:l to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:l to 16. 

Preferably, the polypeptide comprises an amino acid sequence which is at least 
60%, more preferably at least 65%, more preferably at least 70%, more preferably at 
35 least 75%, more preferably at least 76%, more preferably at least 80%, more preferably 
at least 85%, more preferably at least 90%, more preferably at least 91%, more 
preferably at least 92%, more preferably at least 93%, more preferably at least 94%, 
more preferably at least 95%, more preferably at least 96%, more preferably at least 



4 



10 



30 



35 



97%, more preferably at least 98%, more preferably at least 99%, more preferably at 
least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more 
preferably at least 99.4%, more preferably at least 99.5%, more preferably at least 
99.6%, more .preferably at least 99.7%, more preferably at least 99.8%, and even more 
preferably, at least 99.9% identical to any one of SEQ ID NO's: 1 to 16. 

In one embodiment, the polypeptide comprises a sequence selected from the 
group consisting of: 

i) an amino acid sequence provided as SEQ ID NO: 1; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:l. 
In another embodiment, the polypeptide comprises a sequence selected from the 

group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:2; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:2. 

In a further embodiment, the polypeptide comprises a sequence selected from the 
15 group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:3; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:3. 

In another embodiment, the method comprises recombinantly expressing the 
polypeptide in said plant. 

In an alternate embodiment, the method comprises reducing the level of the 
polypeptide endogenously produced by the plant. This can be achieved by any means 
known in the art. One example is by exposing the plant to an antisense polynucleotide 
or a catalytic polynucleotide which hybridizes to an mRNA molecule encoding the 
polypeptide. Another example is by exposing the plant to a dsRNA molecule that 
specifically down^regulates mRNA levels in a cell of an mRNA molecule encoding the 
polypeptide, 

In a further embodiment, the plant is a horticultural plant. 

In a particularly preferred embodiment, the plant is a species of the Genus 
Gossypium. 

In a second aspect, the present invention provides a method of assessing the 
potential of a fibre producing plant to produce fibre, the method comprising analysing 
the plant for a genetic variation in a polynucleotide associated with fibre initiation 
and/or elongation, wherein the polynucleotide comprises a sequence selected from the 
group consisting of: 

i) a nucleotide sequence provided as any one of SEQ ID NO's: 1 7 to 45; or 

ii) a nucleotide sequence which is at least 50% identical to any one of SEO ID 
NO's:17to45. 
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Preferably, the polynucleotide comprises a nucleotide sequence which is at least 
60%, more preferably at least 65%, more preferably at least 70%, more preferably at 
least 75%, more preferably at least 76%, more preferably at least 80%, more preferably 
at least 85%, more preferably at least 90%, more preferably at least 91%, more 
5 preferably at least 92%, more preferably at least 93%, more preferably at least 94%, 
more preferably at least 95%, more preferably at least 96%, more preferably at least 
97%, more preferably at least 98%, more preferably at least 99%, more preferably at 
least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more 
preferably at least 99,4%, more preferably at least 99.5%, more preferably at least 

10 99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more 
preferably at least 99.9% identical to any one of SEQ ID NO's: 17 to 45. 

As the skilled addressee would be aware, the genetic variation associated with 
fibre initiation and/or elongation may be in the coding portion of a polynucleotide of the 
the invention, or may be genetically linked to be useful as a marker for fibre initiation 

1 5 and/or elongation. 

In one embodiment, the method comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the production of an amplicon in said 
amplification reaction indicates an association with fibre producing potential. 

20 In another embodiment, the method comprises performing an amplification 

reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein the lack of production of an 
amplicon in said amplification reaction indicates an association with fibre producing 
potential. 

25 In a further embodiment, the method comprises performing a hybridization 

reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein a. detectable signal produced by the 
hybridization reaction indicates reduced fibre producing potential. 

In yet another embodiment, the method comprises performing a hybridization 
30 reaction on nucleic acids obtained from said plant, or nucleic acids synthesized using 
nucleic acids from said plant as a template, wherein the lack of a detectable signal by 
the hybridization reaction indicates reduced fibre producing potential. 

In an alternate embodiment, the polynucleotide is mRNA and the method 
comprises determining the levels of mRNA of the polynucleotide in the plant ovule at, 
35 or around, anthesis. 

In a third aspect, the present invention provides a method of assessing the 
potential of a fibre producing plant to produce fibre, the method comprising analysing 
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the plant for polypeptide involved in fibre initiation and/or elongation, wherein the 
polypeptide comprises a sequence selected from the group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO's:l to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
5 NO's:ltol6. 

Preferably, the method comprises determining the levels of the polypeptide in the 
plant ovule at, or around, anthesis. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
1° i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:l, 

ii) a polypeptide comprising an amino acid sequence which is at least 87% 
identical to SEQ ID NO: 1 , and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation. 

Preferably, the polypeptide comprises an amino acid sequence which is at least 
95% identical to SEQ ID NO:l. 

In a further aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 
20 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:2, and 

ii) a biologically active fragment of i), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

In yet another aspect, the present invention provides a substantially purified 
25 polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:3, 

ii) a polypeptide comprising, an amino acid sequence which is at least 54% 
identical to SEQ ID NO: 3, and 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

35 NO:4, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO:4, and 

iii) a biologically active fragment of i) or ii), 
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wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

5 NO:5, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO:5, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation, 
10 In another aspect, the present invention provides a substantially purified 

polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:6, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
1 5 identical to SEQ ID NO:6, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: \ 

20 *) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:7, 

ii) a polypeptide comprising an amino acid sequence which is at least 79% 
identical to SEQ ID NO:7, and 

iii) a biologically active fragment of i) or ii), 

25 wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:8, 

30 ii) a polypeptide comprising an amino acid sequence which is at least 66% 

identical to SEQ ID NO:8, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
35 polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:9, 
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ii) a polypeptide comprising an amino acid sequence which is at least 95% 
identical to SEQ ID NO:9, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides s substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 10, 

ii) a polypeptide comprising an amino acid sequence which is at least 67% 
identical to SEQ ID NO: 10, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO:ll, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO: 1 1 , and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 12, 

ii) a polypeptide comprising an amino acid sequence which is at least 59% 
identical to SEQ ID NO: 12, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 13, 

ii) a polypeptide comprising an amino acid sequence which is at least 77% 
identical to SEQ ID NO: 13, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 



9 



i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 14, . 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO: 14, and 

5 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

In another aspect, the present invention provides a substantially purified 
polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
10 NO: 15, 

ii) a polypeptide comprising an amino acid sequence which is at least 64% 
identical to SEQ ID NO: 1 5, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
15 In another aspect, the present invention provides a substantially purified 

polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 16, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
20 identical to SEQ ID NO: 1 6, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

With regard to the polypeptide aspects, it will be appreciated that % identity 
figures higher than those provided above will encompass preferred embodiments. Thus, 

25 where applicable, in light of the minimum % identity figures, it is preferred that the 
polypeptide comprises an amino acid sequence which is at least 60%, more preferably at 
least 65%, more preferably at least 70%, more preferably at least 75%, more preferably 
at least 76%, more preferably at least 80%, more preferably at least 85%, more 
preferably at least 90%, more preferably at least 91%, more preferably at least 92%, 

30 more preferably at least 93%, more preferably at least 94%, more preferably at least 
95%, more preferably at least 96%, more preferably at least 97%, more preferably at 
least 98%, more preferably at least 99%, more preferably at least 99.1%, more 
preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 
99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably 

35 at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% 
identical to the relevant nominated SEQ ID NO. 

Preferably, the polypeptide can be purified from a species of the Genus 
Gossypium. ' 
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Preferably, the polypeptide is a fusion protein further comprising at least one 
other polypeptide sequence. 

In a preferred embodiment, the at least one other polypeptide is selected from the 
group consisting of: a polypeptide that enhances the stability of a polypeptide of the 
5 present invention, a polypeptide that assists in the purification of the fusion protein, and 
a polypeptide which assists in the polypeptide of the invention being secreted from a 
cell (particularly a plant cell). 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO: 17; 

ii) a sequence of nucleotides as provided in SEQ ID NO: 18; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 87% identical to SEQ ID NO: 17 or 
SEQ ID NO: 18; and 

15 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:46. 

In another aspect, the present invention provides an isolated polynucleotide 
20 comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 1 9, 

ii) a sequence of nucleotides as provided in SEQ ID NO:20, 

iii) a sequence encoding a polypeptide of the invention, and 

iv) a sequence complementary to any one of i) to iii). 

25 In another aspect, the. present invention provides an isolated polynucleotide 

comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:21; 

ii) a sequence of nucleotides as provided in SEQ ID NO:22; 

iii) a sequence encoding a polypeptide of the invention; 

30 iv) a sequence of nucleotides which is at least 54% identical to SEQ ID NO:21 or 

SEQIDNO:22;and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
35 provided in SEQ ID NO:47. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
i) a sequence of nucleotides as provided in SEQ ID NO:23; 
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ii) a sequence encoding a polypeptide of the invention; 

iii) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:23; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
5 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:48. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO:24; 

ii) a sequence of nucleotides as provided in SEQ ID NO:25; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO;24 or 
SEQIDNO:25;ancl 

15 v ) a sequence which hybridizes to any one of i)„to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:49. 

In another aspect, the present invention provides an isolated polynucleotide 
20 comprising a sequence of nucleotides selected from the group consisting of; 

i) a sequence of nucleotides as provided in SEQ ID NO:26; 

ii) a sequence of nucleotides as provided in SEQ ID NO:27; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:26 or 
25 SEQ ID NO:27; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:50. 
30 In another aspect, the present invention provides an isolated polynucleotide 

comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:28, 

ii) a sequence of nucleotides as provided in SEQ ID NO:29, 

iii) a sequence encoding a polypeptide of the invention, 
35 iv) a sequence complementary to any one of i) to iii). 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
i) a sequence of nucleotides as provided in SEQ ID NO:30; 
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ii) a sequence of nucleotides as provided in SEQ ID NO:31; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:30 or 
SEQIDNO:31;ancl 

5 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:5 1 . 

In another aspect, the present invention provides an isolated polynucleotide 
10 comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:32, 

ii) a sequence of nucleotides as provided in SEQ ID NO:33, 

iii) a sequence encoding a polypeptide of the invention, 

iv) a sequence complementary to any one of i) to iii). 

15 In another aspect, the present invention provides an isolated polynucleotide 

comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:34; 

ii) a sequence of nucleotides as provided in SEQ ID NO:35; 

iii) a sequence encoding a polypeptide of the invention; 

20 iv ) a sequence of nucleotides which is at least 70% identical to SEQ ID NO:34 or 

SEQ ID NO:35; ancl 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
25 provided in SEQ ID NO:52. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:36; 

ii) a sequence of nucleotides as provided in SEQ ID NO:37; 
30 iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:36 or 
SEQ ID NO:37; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

35 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:53. 

In another aspect, the present invention provides an isolated polynucleotide, 
comprising a sequence of nucleotides selected from the group consisting of: 
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i) a sequence of nucleotides as provided in SEQ ID NO:38; 

ii) a sequence encoding a polypeptide of the invention, 

iii) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:38; 

and 

5 iv) a sequence which hybridizes to any one of i) to iii) under high stringency 

conditions. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:39; 
10 ii) a sequence of nucleotides as provided in SEQ ID NO:40; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 95% identical to SEQ ID NO:39 or 
SEQIDNO:40;and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
15 conditions, 

wherein the polynucleotide does not comprise a sequence' of nucleotides as 
provided in SEQ ID NO:54. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
20 i) a sequence of nucleotides as provided in SEQ ID NO:41 ; 

ii) a sequence of nucleotides as provided in SEQ ID NO:42; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:41 or 
SEQ ID NO:42; and 

25 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:55. 

In another aspect, the present invention provides an isolated polynucleotide 
30 comprising a sequence of nucleotides selected from the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:43; 

ii) a sequence of nucleotides as provided in SEQ ID NO:44; 

iii) a sequence encoding a polypeptide of the invention; 

iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:43 or 
35 SEQ ID NO:44; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 
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wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:56. 

In another aspect, the present invention provides an isolated polynucleotide 
comprising a sequence of nucleotides selected from the group consisting of: 
5 i) a sequence of nucleotides as provided in SEQ ID NO:45; 

ii) a sequence encoding a polypeptide of the invention; 

iii) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:45; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
10 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:57. 

With regard to the polynucleotide aspects, it will be appreciated that % identity 
figures higher than those provided above will encompass preferred embodiments. Thus, 

15 where applicable, in light of the minimum % identity figures, it is preferred that the 
polynucleotide comprises a nucleotide sequence which is at least 60%, more preferably 
at least 65%, more preferably at least 70%, more preferably at least 75%, more 
preferably at least 76%, more preferably at least 80%, more preferably at least 85%, 
more preferably at least 90%, more preferably at least 91%, more preferably at least 

20 92%, more preferably at least 93%, more preferably at least 94%, more preferably at 
least 95%, more preferably at least 96%, more preferably at least 97%, more preferably 
at least 98%, more preferably at least 99%, more preferably at least 99.1%, more 
preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 
99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably 

25 at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% 
identical to the relevant nominated SEQ ID NO. 

In a further aspect, the present invention provides a catalytic polynucleotide 
capable of cleaving a polynucleotide according to the invention. 
Preferably, the catalytic polynucleotide is a ribozyme. 

30 In yet another aspect, the present invention provides an oligonucleotide which 

comprises at least 19 contiguous nucleotides of a polynucleotide according to the 
invention. 

In another aspect, the present invention provides a double stranded RNA 
(dsRNA) molecule comprising an oligonucleotide according to the invention, wherein 
35 the portion of the molecule that is double stranded is at least 19 basepairs in length and 
comprises said oligonucleotide. 

Preferably, the dsRNA is expressed from a single promoter, wherein the strands 
of the double stranded portion are linked by a single stranded portion. 
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In a further aspect, the present invention provides a vector comprising or 
encoding the polynucleotide according to the invention. 

The vectors may be, for example, a plasmid, virus, transposon or phage vector 
provided with an origin of replication, and preferably a promotor for the expression of 
5 the polynucleotide and optionally a regulator of the promotor. The vector may contain 
one or more selectable markers, for example an ampicillin resistance gene in the case of 
a bacterial plasmid or a neomycin resistance gene for a mammalian expression vector. 
The vector may be used in vitro, for example for the production of RNA or used to 
transfect or transform a host cell. Preferably, the vector is capable of replication in a 
10 plant cell. 

Preferably, the polynucleotide is operably linked to a plant ovule or fibre specific 
promoter. 

In another aspect, the present invention provides a vector comprising or encoding 
oligonucleotide of the invention or the dsRNA molecule of the invention. 
15 In a further aspect, the present invention provides a host cell comprising a vector 

according to the invention. 

In a further aspect, the present invention provides a transgenic plant, the plant 
having been transformed with polynucleotide according to the invention or an 
oligonucleotide of the invention. 
20 In one embodiment, the polynucleotide is capable of expression to produce a 

polypeptide according to the invention. 

In an alternate embodiment, the plant has been transformed such that it produces 
a catalytic polynucleotide of the invention, or a dsRNA molecule of the invention. 

In a further embodiment, the polynucleotide, catalytic polynucleotide or dsRNA 
25 down-regulates the production of a polypeptide of the invention which is endogenously 
produced by the plant. 

In a further aspect, the present invention provides a substantially purified 
antibody, or fragment thereof, that specifically binds a polypeptide of the invention. 

In another aspect, the present invention provides a method of breeding a fibre 
30 producing plant, the method comprising performing a method according to the first, 
second or third aspects of the invention. 

In a further aspect, the present invention provides a method of selecting from a 
breeding population a fibre producing plant with altered fibre initiation and/or 
elongation potential, the method comprising; 
35 i) crossing two plants which have differing potential to produce fibre, 

ii) performing a method according to the first, second or third aspects of the 
invention on progeny plants, 
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iii) selecting a progeny plant with altered fibre initiation and/or elongation 
potential when compared to a parent plant. 

In another aspect, the present invention provides a plant produced by a method of 
the invention. 

In a further aspect, the present invention provides a seed of a plant, transgenic or 
otherwise, of the invention. 

In a further aspect, the present invention provides fibre of a plant, transgenic or 
otherwise, of the invention. 

In another aspect, the present invention provides a method of identifying an 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
NO f s:l to 16 to a candidate agent, and 

b) assessing the ability of the candidate agent to modulate the activity of the 
polypeptide. 

In a further aspect, the present invention provides a method of identifying an 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
NO'srl to 16 to a binding partner which binds the polypeptide, and a candidate agent, 
and 

b) assessing the ability of the candidate agent to compete with the binding 
partner for binding to the polypeptide. 

Preferably, the binding partner is detectably labeled. 

In a further aspect, the present invention provides a method of identifying ah 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polynucleotide encoding a polypeptide which is at least 50% 
identical to any one of SEQ ID NO's:l to 16 to a candidate agent under conditions 
which allow expression of the polynucleotide, and 

b) assessing the ability of the candidate agent to modulate levels of polypeptide 
produced by the polynucleotide. 

In another aspect, the present invention provides a method of identifying an 
agent which alters fibre initiation and/or elongation of a fibre producing plant, the 
method comprising 

a) exposing a polynucleotide which is at least 50% identical to any one of SEQ 
ID NO's:17 to 45 to a candidate agent, and 
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b) assessing the ability of the candidate agent to hybridize and/or cleave the 
polynucleotide. 

As will be apparent, preferred features and characteristics of one aspect of the 
invention are applicable to many other aspects of the invention. 

Throughout this specification the word "comprise", or variations such as 
"comprises" or "comprising", will be understood to imply the inclusion of a stated 
element, integer or step, or group of elements, integers or steps, but not the exclusion of 
any other element, integer or step, or group of elements, integers or steps. 

The invention is hereinafter described by way of the following non-limiting 
Examples and with reference to the accompanying figures. 

BRIEF DESCRIPTION O ff THE ACCOMPANYING DRAWINGS 
Figure 1. Expression profiles of the candidate genes measured by microarrays. The 
plots on the left side (column A) represent the results from the DP 16 time course 
experiment and the values plotted are the ratios relative to 0 dpa. The plots on the right 
side (column B) represent the results of multi-time point Lintless 4 A/DP 16 
comparisons and the values plotted are the ratios of Lintless 4A/DP16 at the 
corresponding dpa. The genes showing similar expression profiles are plotted together. 
Figure 2. RT PCR of GhMyb25 with 0-tubulin as control. 

a. Different tissues from DP 16. -Co: negative control, reaction without reverse 
transcriptase; O: 0 dpa ovule; P: petal; L: leaf; R: root; S: stem. 

b. Ovule and fibre from DP 16. The numbers indicate the corresponding dpa. 
c Ovule from 3 mutants. The numbers indicated the corresponding dpa. 

d. Southern blotting of the c. hybridised with an ON035F4 probe. 
Figure 3. RT-PCR of GhHDl with p-tubulin as control. -Co: negative control, reaction 
without reverse transcriptase; -4 to 4: DP16 ovules of various stages (dpa) as indicated 
by the corresponding number; 80: 8 dpa ovule of DP 16 after fibres being removed; 8F: 
8 dpa detached fibres of DP16; 0-5B: 0 dpa ovule form mutant 5B; 0-4A: 0 dpa ovule 
from mutant 4A; L: leaf; P: petal; R: root; H: hypocotyl. 

Figure 4. Histograms of relative DNA contents of ovule epidermal cells and fibre cells. 
KEY TO T HE SEQUENCE LISTING 

SEQ ID NO:l - Partial homeodomain like protein encoded by GhHDl cDNA (clone 
ON033M7). 

SEQ ID NO:2 - Myb transcription factor like protein encoded by GhMyb25 cDNA 
(clone ON035F4). 

SEQ ID NO:3 - Partial cyclin D3 like protein encoded by GhCycD3;l cDNA (clone 
OCF07F4). 
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SEQ ID NO:4 - Partial protein encoded by GhFaEl cDNA (clone ON035N9). 

SEQ ID NO:5 - Possible partial protein encoded by GhFUl cDNA (clone ON035C9). 

SEQ ID NO:6 - Possible partial protein encoded by GhFU2 cDNA (clone ON005F1). 

SEQ ID NO:7 - a-expansin like protein cDNA (encoded in part by clone Pfsl4x). 
5 SEQ ID NO:8 - Partial protein encoded by GhTMTP cDNA (clone CHX01 5K1 8). 

SEQ ID NO:9 - Sucrose synthase gene encoded by Ghsus cDNA (clone CHX002C10). 

SEQ ID.NO:10 - Partial protein encoded by GhLTP cDNA (clone ON033M19). 

SEQ ID NO:l 1 - Protein encoded by GhLTP2 cDNA (clone OCF010D8). 

SEQ ID NO: 12 - Partial protein encoded by GhMyb25-like cDNA (clone ON038N8). 
10 SEQ ID NO:13 - Protein encoded by GhRD22 cDNA (clone OCF005C10). 

SEQ ID NO: 14 - Protein encoded by GhRD22-like cDNA (clone OCF010D8). 

SEQ ID NO: 15 - Partial protein encoded by GhAsp cDNA (clone OCF008G9). 

SEQ ID NO: 16 - Partial protein encoded by cDNA clone CHX007D1 . 

SEQ ID NO: 17 - GhHDl cDNA (clone ON033M7). 
1 5 SEQ ID NO: 1 8 - Coding region of GhHD 1 cDNA (clone ON033M7). 

SEQ ID NO: 19 - GhMyb25 cDNA (clone ON035F4). 

SEQ ID NO:20 - Coding region for GhMyb25 cDNA (clone ON035F4). 

SEQ ID NO:21 - GhCycD3;l cDNA (clone OCF07F4). 

SEQ ID NO:22 - Coding region of GhCycD3;l cDNA (clone OCF07F4). 
20 SEQ ID NO:23 - GhFaEl cDNA (clone ON035N9) (entire clone coding region). 

SEQ ID NO:24 - GhFUl cDNA (clone ON035C9). 

SEQ ID NO:25 - Coding region of GhFUl cDNA (clone ON035C9). 

SEQ ID NO;26 - GhFU2 cDNA (clone ON005F1). 

SEQ ID NO:27 - Coding region of GhFU2 cDNA (clone ON005F1). 
25 SEQ ID NO:28 - a-expansin like clone Pfsl4x. 

SEQ ID NO:29 - Coding region of a-expansin like clone PfsHx. 

SEQ ID NO:30 - GhTMTP cDNA (clone CHX015K18). 

SEQ ID NO:31 - Coding region for GhTMTP cDNA (clone CHX015K1 8). 

SEQ ID NO:32 - Ghsus cDNA (clone CHX002C10). 
30 SEQ ID NO:33 - Coding region of Ghsus cDNA (clone CHX002C10). 

SEQ ID NO:34 - GhLTP cDNA (clone ON033M19). 

SEQ ID NO:35 - Coding region for GhLTP cDNA (clone ON033M19). 

SEQ ID NO:36 - GhLTP2 CDNA (clone OCF01 0D8). 

SEQ ID NO:37 - Coding region for GhLTP2 cDNA (clone OCF010D8). 
3 5 SEQ ID NO:3 8 - GhMyb25-like cDNA (clone ON03 8N8) (entire clone coding region). 

SEQ ID NO:39 - GhRD22 cDNA (plone OCF005C10). 

SEQ ID NO:40 - Coding region for GhRD22 cDNA (clone OCF005C10). 

SEQ ID NO:41 - GhRD22-like cDNA (clone OCF010D8). 
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SEQ ID NO:42 - Coding region for GhRD22-like cDNA (clone OCF010D8). 

SEQ ID NO:43 - GhAsp cDNA (clone OCF008G9). 

SEQ ID NO:44 - Coding region for GhAsp cDNA (clone OCF008G9). 

SEQ ID NO:45 - cDNA clone CHX007D 1 (entire clone coding region). 

SEQ ID NO:46 - Cotton EST BE052193. 

SEQ ID NO:47 - Cotton EST BQ412597. 

SEQ ID NO:48 - Cotton EST AI73 1943. 

SEQ ID NO:49 - Cotton EST BG442467. 

SEQ ID NO:50 - Cotton EST BQ403714. 

SEQ ID NO:51 - Cotton EST BG443329. 

SEQ ID NO:52 - Cotton EST BF275 1 77. 

SEQ ID NO:53 - Cotton EST BQ410140. 

SEQ ID NO:54 - Cotton EST CA993037, 

SEQ ID NO:55 - Cotton EST BG441493. 

SEQ ID NO:56 - Cotton EST BQ402375. 

SEQ ID NO:57 - Cotton EST BQ413582. 

SEQ ID NO's:58 to 64 - Stem loop sequences of dsRNA molecules. 
SEQ ID NO's:65 to 72 - Oligonucleotide primers. 

DETAILED DESCRIPTION OF THE INVENTION 

General Techniques and Definitions 

Unless specifically defined otherwise, all technical and scientific terms used 
herein shall be taken to have the same meaning as commonly understood by one of 
ordinary skill in the art (e.g., in cell culture, molecular genetics, plant physiology and 
biochemistry, immunohistochemistry, protein chemistry, and biochemistry). 

Unless otherwise indicated, the recombinant protein, cell culture and others 
methods utilized in the present invention are standard procedures, well known to those 
skilled in the art. Such techniques are described and explained throughout the literature 
in sources such as, J. Perbal, A Practical Guide to Molecular Cloning, John Wiley and 
Sons (1984), J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbour Laboratory Press (1989), TA. Brown (editor), Essential Molecular Biology: A 
Practical Approach, Volumes 1 and 2, IRL Press (1991), D.M. Glover and B.D. Hames 
(editors), DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 
1996), and F.M. Ausubel et al., (editors), Current Protocols in Molecular Biology, 
Greene Pub. Associates and Wiley-Interscience (1988, including all updates until 
present), Ed Harlow and David Lane (editors) Antibodies: A Laboratory Manual, Cold 
Spring Harbour Laboratory, (1988), and J.E. Coligan et al., (editors) Current Protocols 



20 



in Immunology, John Wiley & Sons (including all updates until present), and are 
incorporated herein by reference. 

As used herein, the term "gene" is to be taken in its broadest context and includes 
the deoxyribonucleotide sequences comprising the protein coding region of a structural 
gene and including sequences located adjacent to the coding region on both the 5' and 3' 
ends for a distance of about 1 kb on either end such that the gene corresponds to the 
length of the full-length mRNA. The sequences which are located 5' of the coding 
region and which are present on the mRNA are referred to as 5' non-translated 
sequences. The sequences which are located 3' or downstream of the coding region and 
which are present on the mRNA are referred to as 3 1 non-translated sequences; these 
sequences. The term "gene" encompasses both cDNA and genomic fomis of a gene. A 
genomic form or clone of a gene contains the coding region which may be interrupted 
with non-coding sequences termed "introns" or "intervening regions" or "intervening 
sequences." Introns are segments of a gene which are transcribed into nuclear RNA 
(hnRNA); introns may contain regulatory elements such as enhancers. Introns are 
removed or "spliced out" from the nuclear or primary transcript; introns therefore are 
absent in the messenger RNA (mRNA) transcript. The mRNA functions during 
translation to specify the sequence or order of amino acids in a nascent polypeptide. 
The term "gene" includes a synthetic or fusion molecule encoding all or part of the 
proteins of the invention described herein and a complementary nucleotide sequence to 
any one of the above. 

A "polymorphism" as used herein denotes a variation in the nucleotide sequence 
between genes of the invention, of different species, cultivars, strains or individuals of a 
plant. A "polymorphic position" is a preselected nucleotide position within the 
sequence of the gene. In some cases, genetic polymorphisms are reflected by an amino 
acid sequence variation, and thus a polymorphic position can result in location of a 
polymorphism in the amino acid sequence at a predetermined position in the sequence 
of a polypeptide. Typical polymorphisms are deletions, insertions or substitutions. 
These can involve a single nucleotide (single nucleotide polymorphism or SNP) or two 
or more nucleotides. 

A "deletion," as used herein, refers to a change in either amino acid or nucleotide 
sequence in which one or more amino acid or nucleotide residues, respectively, are 
absent. 

An "insertion" or "addition," as used herein, refers to a change in an amino acid 
or nucleotide sequence resulting in the addition of one or more amino acid or nucleotide 
residues, respectively. 

A "substitution," as used herein, refers to the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, respectively. 
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By "linked" or "genetically linked" it is meant that a marker locus and a second 
locus are sufficiently close on a chromosome that they will be inherited together in more 
than 50% of meioses, e.g., not randomly. Thus, the percent of recombination observed 
between the loci per generation (centimorgans (cM)), will be less than 50. In particular 
embodiments of the invention, genetically linked loci may be 45, 35, 25, 15, 10, 5, 4, 3, 
2, or 1 or less cM apart on a chromosome. Preferably, the markers are less than 5 cM 
apart and most preferably about 0 cM apart. 

The term "fibre" refers to plant cell types that share in common the features of 
having an elongated shape and abundant cellulose in thick cell walls, usually, but not 
always, described as secondary walls. Such walls may or may not be lignified, and the 
protoplast of such cells may or may not remain alive at maturity. Here the term "fibre" 
is used in its most inclusive sense, for example including: (a) thick-walled conducting 
and non-conducting cells of the xylem; (b) fibres of extraxylary origin, including those 
from ovary including the outer integument of the ovary, phloem, bark, ground tissue, 
and epidermis; and (c) fibres from stems, leaves, roots, seeds, and flowers or 
inflorescences. Such fibres have many industrial uses, for example in textiles, paper, 
sacking and boxing material, cordage, brushes and brooms, filling and stuffing, 
caulking, reinforcement of other materials, and manufacture of cellulose derivatives. In 
some industries, the term "fibre" is usually inclusive of thick-walled conducting cells 
such as vessels and tracheids and to fibrillar aggregates of many individual fibre cells. 
In a preferred embodiment, cotton fibre refers to the lint produced from the cotton boll 
(seed capsule) that is produced commercially. Cotton fibre also includes the short (about 
several mm) fibres sometimes referred to as "fuzz fibres". Preferred fibre producing 
plants include, but are not limited to, cotton (such as Gossypium arboreum, Gossypium 
herbaceum, Gossypium barbadense and Gossypium hirsutum), silk cotton tree (Kapok, 
Ceiba pentandra), desert willow, creosote bush, winterfal, balsa, ramie, kenaf, hemp 
(Cannabis sativa), roselle, jute, sisal abaca, flax, and horticultural plants such as grape, 
peach, pear, and apple. 

As used herein, a "wild-type plant" is a plant that has not been altered by a 
method of the invention and/or does not comprise a transgene of the invention. When 
performing a method of the invention for altering fibre initiation and/or elongation, the 
manipulated plant is compared to a non-manipulated ("wild-type") member of the same 
species to determine the impact of the manipulation on fibre initiation and/or elongation. 

In one embodiment, the term "altering fibre initiation and/or elongation" refers to 
increasing the number and/or size of the fibres. In another embodiment, the term 
"altering fibre initiation and/or elongation" refers to decreasing the number and/or size 
of the fibres. In a further embodiment, the term "altering fibre initiation and/or 
elongation" refers to modifying the timing of fibre initiation and/or elongation during 
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development of the plant, for example to promote earlier or delayed initiation, or to 
regulate the synchrony of fibre initiation. 

Detection of impaired genes and/or gene expression levels 
5 Any molecular biological technique known in the art which is capable of 

detecting a polymoiphism/mutation/genetic variation or differential gene expression can 
be used in the methods of the present invention. Such methods include, but are not 
limited to, the use of nucleic acid amplification, nucleic acid sequencing, nucleic acid 
hybridization with suitably labeled probes, single-strand conformational analysis 

10 (SSCA), denaturing gradient gel electrophoresis (DGGE), heteroduplex analysis (HET), 
chemical cleavage analysis (CCM), catalytic nucleic acid cleavage, or a combination 
thereof (see, for example, Lemieux, 2000), The invention also includes the use of 
molecular marker techniques to detect polymorphisms closely linked to genes of the 
invention. Such methods include the detection or analysis of restriction fragment length 

15 polymorphisms (RFLP), RAPD, amplified fragment length polymoiphisms (AFLP) and 
microsatellite (simple sequence repeat, SSR) polymorphisms. The closely linked 
markers can be obtained readily by methods well known in the art, such as Bulked 
Segregant Analysis. 

The "polymerase chain reaction" ("PGR") is a reaction in which replicate copies 

20 are made of a target polynucleotide using a "pair of primers" or "set of primers" 
consisting of "upstream" and a "downstream" primer, and a catalyst of polymerization, 
such as a DNA polymerase, and typically a thermally-stable polymerase enzyme. 
Methods for PCR are known in the art, and are taught, for example, in "PGR" (Ed. M. J. 
McPherson and S.G Moller (2000) BIOS Scientific Publishers Ltd, Oxford). PCR can 

25 be performed on cDNA obtained from reverse transcribing mRNA isolated from plant 
cells expressing, or that should be expressing, a gene of the invention. However, it will 
generally be easier if PCR is performed on genomic DNA isolated from a plant. 

A primer is an oligonucleotide, usually of about 20 nucleotides long, with a 
minimum of about 15 and a maximum of about 50 nucleotides, that is capable of 

30 hybridising in a sequence specific fashion to the target sequence and being extended 
during the PGR. Amplicons or PCR products or PCR fragments or amplification 
products are extension products that comprise the primer and the newly synthesized 
copies of the target sequences. Multiplex PCR systems contain multiple sets of primers 
that result in simultaneous production of more than one amplicon. Primers may be 

35 perfectly matched to the target sequence or they may contain internal mismatched bases 
that can result in the induction of restriction enzyme or catalytic nucleic acid 
recognition/cleavage sites in specific target sequences. Primers may also contain 
additional sequences and/or modified or labelled nucleotides to facilitate capture or 
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detection of amplicons. Repeated cycles of heat denaturation of the DNA, annealing of 
primers to their complementary sequences and extension of the annealed primers with 
polymerase result in exponential amplification of the target sequence. The terms target 
or target sequence or template refer to nucleic acid sequences which are amplified. 
5 Methods for direct sequencing of nucleotide sequences are well known to those 

skilled in the art and can be found for example in Ausubel et al., eds., Short Protocols in 
Molecular Biology, 3rd ed., Wiley, (1995) and Sambrook et al, Molecular Cloning, 2nd 
ed., Chap. 13, Cold Spring Harbor Laboratory Press, (1989). Sequencing can be carried 
out by any suitable method, for example, dideoxy sequencing, chemical sequencing or 

10 variations thereof. Direct sequencing has the advantage of determining variation in any 
base pair of a particular sequence, 

Hybridization based detection systems include, but are not limited to, the 
TaqMan assay and molecular beacons. The TaqMan assay (US 5,962,233) uses allele 
specific (ASO) probes with a donor dye on one end and an acceptor dye on the other 

15 end such that the dye pair interact via fluorescence resonance energy transfer (FRET). 
A target sequence is amplified by PCR modified to include the addition of the labeled 
ASO probe. The PCR conditions are adjusted so that a single nucleotide difference will 
effect binding of the probe. Due to the 5' nuclease activity of the Taq polymerase 
enzyme, a perfectly complementary probe is cleaved during PCR while a probe with a 

20 single mismatched base is not cleaved. Cleavage of the probe dissociates the donor dye 
from the quenching acceptor dye, greatly increasing the donor fluorescence. 

An alternative to the TaqMan assay is the molecular beacon assay (US 
5,925,517). In the molecular beacon assay, the ASO probes contain complementary 
sequences flanking the target specific species so that a hairpin structure is formed. The 

25 loop of the hairpin is complimentary to the target sequence while each arm of the 
hairpin contains either donor or acceptor dyes. When not hybridized to a donor 
sequence, the hairpin structure brings the donor and acceptor dye close together thereby 
extinguishing the donor fluorescence. When hybridized to the specific target sequence, 
however, the donor and acceptor dyes are separated with an increase in fluorescence of 

30 up to 900 fold. Molecular beacons can be used in conjunction with amplification of the 
target sequence by PCR and provide a method for real time detection of the presence of 
target sequences or can be used after amplification. 

Marker assisted selection is a well recognised method of selecting for 
heterozygous plants required when backcrossing with a recurrent parent in a classical 

35 breeding program. The population of plants in each backcross generation will be 
heterozygous for the gene of interest, normally present in a 1:1 ratio in a backcross 
population, and the molecular marker can be used to distinguish the two alleles. By 
extracting DNA from, for example, young leaves or shoots and testing with a specific 
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marker for the introgressed desirable trait, early selection of plants for further 
backcrossing is made whilst energy and resources are concentrated on fewer plants. 

Polypeptides 

5 By "substantially purified polypeptide" we mean a polypeptide that has been at 

least partially separated from the lipids, nucleic acids, other polypeptides, and other 
contaminating molecules with which it is associated in its native state. Preferably, the 
substantially purified polypeptide is at least 60% free, preferably at least 75% free, and 
most preferably at least 90% free from other components with which they are naturally 
10 associated. Furthermore, the term "polypeptide" is used interchangeably herein with the 
term "protein". 

The % identity of a polypeptide is determined by GAP (Needleman and Wunsch, 
1970) analysis (GCG program) with a gap creation penalty=5, and a gap extension 
penal ty=0.3. Unless stated otherwise, the query sequence is at least 15 amino acids in 

15 length, and the GAP analysis aligns the two sequences over a region of at least 15 amino 
acids. More preferably, the query sequence is at least 50 amino acids in length, and the 
GAP analysis aligns the two sequences over a region of at least 50 amino acids. Even 
more preferably, the query sequence is at least 100 amino acids in length and the GAP 
analysis aligns the two sequences over a region of at least 100 amino acids. 

20 Amino acid sequence mutants of the polypeptides of the present invention can be 

prepared by introducing appropriate nucleotide changes into a nucleic acid of the 
present invention, or by in vitro synthesis of the desired polypeptide. Such mutants 
include, for example, deletions, insertions or substitutions of residues within the amino 
acid sequence. A combination of deletion, insertion and substitution can be made to 

25 arrive at the final construct, provided that the final protein product possesses the desired 
characteristics. 

In designing amino acid sequence mutants, the location of the mutation site and 
the nature of the mutation will depend on characteristic(s) to be modified. The sites for 
mutation can be modified individually or in series, e.g., by (1) substituting first with 

30 conservative amino acid choices and then with more radical selections depending upon 
the results achieved, (2) deleting the target residue, or (3) inserting other residues 
adjacent to the located site, 

Amino acid sequence deletions generally range from about 1 to 30 residues, 
more preferably about 1 to 10 residues and typically about 1 to 5 contiguous residues. 

35 Substitution mutants have at least one amino acid residue in the polypeptide 

molecule removed and a different residue inserted in its place. The sites of greatest 
interest for substitutional mutagenesis include sites identified as the active or binding 
site(s). Other sites of interest are those in which particular residues obtained from 
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various strains or species are identical. These positions may be important for biological 
activity. These sites, especially those falling within a sequence of at least three other 
identically conserved sites, are preferably substituted in a relatively conservative 
manner. Such conservative substitutions are shown in Table 1 . 

5 

TABLE 1 . Exemplary substitutions. 



Original 


Exemplary 


Residue 


Substitutions 


Ala (A) 


val; leu; ile; gly 


Arg(R) 


lys 


Asn (N) 


gin; his 


Asp (D) 


glu 


Cys (C) 


ser 


Gln(Q) 


asn; his 


Glu (E) 


asp 


Gly (G) 


pro, ala 


TTJ ~ /TT\ 

rlis (hi) 


asn; gin 


He (I) 


leu; val; ala 


Leu (L) 


ile; val; met; ala; phe 


Lys (K) 


arg 


Met (M) 


leu; phe 


Phe (F) 


leu; val; ala 


Pro (P) 


gly 


Ser (S) 


thr 


Thr(T) 


ser 


Trp(W) 


tyr 


Tyr (Y) 


trp; phe 


Val(V) 


ile; leu; met; phe, ala 



Furthermore, if desired, unnatural amino acids or chemical amino acid analogues 
10 can be introduced as a substitution or addition into the polypeptides of the present 
invention. Such amino acids include, but are not limited to, the D-isomers of the 
common amino acids,. 2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric 
acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino 
propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, 
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homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, 
cyclohexylalanine, p -alanine, fluoro-amino acids, designer amino acids such as p~ 
methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogues in general. 

5 Also included within the scope of the invention are polypeptides of the present 

invention which are differentially modified during or after synthesis, e.g., by 
biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, 
derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an 
antibody molecule or other cellular ligand, etc. These modifications may serve to 

1 0 increase the stability and/or bioactivity of the polypeptide of the invention. 

Polypeptides of the present invention can be produced in a variety of ways, 
including production and recovery of natural proteins, production and recovery of 
recombinant proteins, and chemical synthesis of the proteins. In one embodiment, an 
isolated polypeptide of the present invention is produced by culturing a cell capable of 

15 expressing the polypeptide under conditions effective to produce the polypeptide, and 
recovering the polypeptide. A preferred cell to culture is a recombinant cell of the 
present invention. Effective culture conditions include, but are not limited to, effective 
media, bioreactor, temperature, pH and oxygen conditions that permit protein 
production. An effective medium refers to any medium in which a cell is cultured to 

20 produce a polypeptide of the present invention. Such medium typically comprises an 
aqueous medium having assimilable carbon, nitrogen and phosphate sources, and 
appropriate salts, minerals, metals and other nutrients, such as vitamins. Cells of the 
present invention can be cultured in conventional fermentation bioreactors, shake flasks, 
test tubes, microtiter dishes, and petri plates. Culturing can be carried out at a 

25 temperature, pH and oxygen content appropriate for a recombinant cell. Such culturing 
conditions are within the expertise of one of ordinary skill in the art. 

In a preferred embodiment, the polypeptides of the invention are transcription 
factors, regulatory proteins, or proteins that regulate the cell-cycle in the fibre producing 
plant. The transcription factors may be Myb transcription factors or homeodomain 

30 containing transcription factors, which are classes well known in the art. 

Polynucleotides 

By an "isolated polynucleotide", including DNA, RNA, or a combination of 
these, single or double stranded, in the sense or antisense orientation or a combination of 
35 both, dsRNA or otherwise, we mean a polynucleotide which is at least partially 
separated from the polynucleotide sequences with which it is associated or linked in its 
. native state. Preferably, the isolated polynucleotide is at least 60% free, preferably at 
least 75% free, and most preferably at least 90% free from other components with which 
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they are naturally associated. Furthermore, the term "polynucleotide" is used 
interchangeably herein with the term "nucleic acid molecule". 

The % identity of a polynucleotide is determined by GAP (Needleman and 
Wunsch, 1970) analysis (GCG program) with a gap creation penalty=5, and a gap 
5 extension penalty=0.3. Unless stated otherwise, the query sequence is at least 45 
nucleotides in length, and the GAP analysis aligns the two sequences over a region of at 
least 45 nucleotides. Preferably, the query sequence is at least 150 nucleotides in 
length, and the GAP analysis aligns the two sequences over a region of at least 150 
nucleotides. More preferably, the query sequence is at least 300 nucleotides in length 

1 0 and the GAP analysis aligns the two sequences over a region of at least 300 nucleotides. 

Oligonucleotides of the present invention can be RNA, DNA, or derivatives of 
either. The minimum size of such oligonucleotides is the size required for the formation 
of a stable hybrid between an oligonucleotide and a complementary sequence on a 
nucleic acid molecule of the present invention. Preferably, the oligonucleotides are at 

15 least 15 nucleotides, more preferably at least 18 nucleotides, more preferably at least 19 
nucleotides, more preferably at least 20 nucleotides, even more preferably at least 25 
nucleotides in length. The present invention includes oligonucleotides that can be used 
as, for example, probes to identify nucleic acid molecules, primers to produce nucleic 
acid molecules or as agents to modify fibre initiation and/or elongation (e.g., as 

20 antisense-, triplex formation-, ribozyme- and/or RNA drug-based reagents). 
Oligonucleotide of the present invention used as a probe are typically conjugated with a 
label such as a radioisotope, an enzyme, biotin, a fluorescent molecule or a 
chemiluminescent molecule. 

Polynucleotides and oligonucleotides of the present invention include those 

25 . which hybridize under stringent conditions to a sequence provided as SEQ ID NO's: 17 
to 45. As used herein, stringent conditions are those that (1) employ low ionic strength 
and high temperature, for washing, for example, 0.015 M NaCl/0.0015 M sodium 
citrate/0.1% NaDodS0 4 at 50°C; (2) employ during hybridisation a denaturing agent 
such as formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum 

30 albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer at 
pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42°C; or (3) employ 50% 
formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate 
(pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm 
DNA (50 g/ml), 0.1% SDS and 10% dextran sulfate at 42°C in 0.2 x SSC and 0.1% 

35 SDS. 

Polynucleotides of the present invention may possess, when compared to 
naturally occurring molecules, one or more mutations which are deletions, insertions, or 
substitutions of nucleotide residues. Mutants can be either naturally occurring (that is to 
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say, isolated from a natural source) or synthetic (for example, by performing site- 
directed mutagenesis on the nucleic acid). 

Antisense Polynucleotides 
5 The term "antisense nucleic acid" shall be taken to mean DNA or RNA, or 

combination thereof, molecule that is complementary to at least a portion of a specific 
mRNA molecule of the invention and capable of interfering with a post-transcriptional 
event such as mRNA translation. . The use of antisense methods is well known in the 
art (see for example, G. Hartmann and S. Endres, Manual of Antisense Methodology, 
1 0 Kluwer (1 999)). The use of antisense techniques in plants has been reviewed by 

Bourque (1995) and Senior (1998). Bourque lists a large number of examples of how 
antisense sequences have been utilized in plant systems as a method of gene 
inactivation. She also states that attaining 100% inhibition of any enzyme activity may 
not be necessary as partial inhibition will more than likely result in measurable change 

15 in the system. Senior (1 998) states that antisense methods are now a very well 
established technique for manipulating gene expression. 

Antisense molecules may include sequences that correspond to the structural 
genes or for sequences that effect control over the gene expression or splicing event. 
For example, the antisense sequence may correspond to the targeted coding region of 

20 the genes of the invention, or the 5 ' -untranslated region (UTR) or the 3 '-UTR or 

combination of these. It may be complementary in part to intron sequences, which may 
be spliced out during or after transcription, preferably only to exon sequences of the 
target gene. In view of the generally greater divergence of the UTRs, targeting these 
regions provides greater specificity of gene inhibition. The length of the antisense 

25 sequence should be at least 19 contiguous nucleotides, preferably at least 50 nucleotides, 
and more preferably at least 100, 200, 500 or 1000 nucleotides. The full-length sequence 
complementary to the entire gene transcript may be used. The length is most preferably 
100-2000 nucleotides. The degree of homology of the antisense sequence to the 
targeted transcript should be at least 85%, preferably at least 90% and more preferably 

30 95-1 00%. The antisense RNA molecule may of course comprise unrelated sequences 
which may function to stabilize the molecule. 

Catalytic Polynucleotides 

The term catalytic polynucleotide/nucleic acid refers to a DNA molecule or 
35 DNA-containing molecule (also known in the art as a "deoxyribozyme") or an RNA or 
RNA-containing molecule (also known as a "ribozyme") which specifically recognizes a 
distinct substrate and catalyzes the chemical modification of this substrate. The nucleic 
acid bases in the catalytic nucleic acid can be bases A, C, G, T (and U for RNA). 
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Typically, the catalytic nucleic acid contains an antisense sequence for specific 
recognition of a target nucleic acid, and a nucleic acid cleaving enzymatic activity (also 
referred to herein as the "catalytic domain"). The types of ribozymes that are 
particularly useful in this invention are the hammerhead ribozyme (Haseloff and 
5 Gerlach, 1988, Perriman et al., 1 992) and the hairpin ribozyme (Shippy et al., 1 999). 

The ribozymes of this invention and DNA encoding the ribozymes can be 
chemically synthesized using methods well known in the art. The ribozymes can also 
be prepared from a DNA molecule (that upon transcription, yields an RNA molecule) 
operably linked to an RNA polymerase promoter, e.g., the promoter for T7 RNA 

10 polymerase or SP6 RNA polymerase. Accordingly, also provided by this invention is a 
nucleic acid molecule, i.e., DNA or cDNA, coding for the ribozymes of this invention. 
When the vector also contains an RNA polymerase promoter operably linked to the 
DNA molecule, the ribozyme can be produced in vitro upon incubation with RNA 
polymerase and nucleotides. In a separate embodiment, the DNA can be inserted into 

15 an expression cassette or transcription cassette. After synthesis, the RNA molecule can 
be modified by ligation to a DNA molecule having the ability to stabilize the ribozyme 
and make it resistant to RNase. 

RNA interference 

20 RNA interference (RNAi) is particularly useful for specifically inhibiting the 

production of a particular protein. Although not wishing to be limited by theory, 
Waterhouse et al. (1998) have provided a model for the mechanism by which dsRNA 
can be used to reduce protein production. . This technology relies on the presence of 
dsRNA molecules that contain a sequence that is essentially identical to the mRNA of 

25 the gene of interest or part thereof, in this case an mRNA encoding a polypeptide 

according to the invention. Conveniently, the dsRNA can be produced from a single 
promoter in a recombinant vector or host cell, where the sense and anti-sense sequences 
are flanked by an unrelated sequence which enables the sense and anti-sense sequences 
to hybridize to form the dsRNA molecule with the unrelated sequence forming a loop 

30 structure. The design and production of suitable dsRNA molecules for the present 
invention is well within the capacity of a person skilled in the art, particularly 
considering Waterhouse et al. (1998), Smith et al. (2000), WO 99/32619, WO 99/53050, 
WO 99/49029, and WO 01/34815. In this method a DNA is introduced that directs the 
synthesis of an at least partly double stranded RNA product(s) with homology to the 

35 target gene to be inactivated. The DNA therefore comprises both sense and antisense 
sequences that, when transcribed into RNA, can hybridize to form the double-stranded 
RNA region. In a preferred embodiment, the sense and antisense sequences are 
separated by a spacer region that comprises an intron which, when transcribed into 
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RNA, is spliced out. This arrangement has been shown to result in a higher efficiency 
of gene silencing. The double-stranded region may comprise one or two RNA 
molecules, transcribed from either one DNA region or two. The presence of the double 
stranded molecule is thought to trigger a response from an endogenous plant system that 
5 destroys both the double stranded RNA and also the homologous RNA transcript from 
the target plant gene, efficiently reducing or eliminating the activity of the target gene. 
The length of the sense and antisense sequences that hybridise should each be at least 19 
contiguous nucleotides, preferably at least 30 or 50 nucleotides, and more preferably at 
least 100, 200, 500 or 1000 nucleotides. The full-length sequence corresponding to the 

1 0 entire gene transcript may be used. The lengths are most preferably 1 00-2000 

nucleotides. The degree of homology of the sense and antisense sequences to the 
targeted transcript should be at least 85%, preferably at least 90% and more preferably 
95-100%. The RNA molecule may of course comprise unrelated sequences which may 
function to stabilize the molecule. The RNA molecule may be expressed under the 

1 5 control of a RNA polymerase II. or RNA polymerase III promoter. Examples of the 
latter include tRNA or snRNA promoters. 

Preferred small interfering RNA ('siRNA") molecules comprise a nucleotide 
sequence that is identical to about 19-21 contiguous nucleotides of the target mRNA. 
Preferably, the target mRNA sequence commences with the dinucleotide AA, comprises 

20 a GC-content of about 30-70% (preferably, 30-60%, more preferably 40-60% and more 
preferably about 45%-55%), and does not have a high percentage identity to any 
nucleotide sequence other than the target in the genome of the plant (preferably cotton) 
in which it is to be introduced, e.g., as determined by standard BLAST search. 

Preferred loop ("single stranded") sequences are selected from, but not limited to, 

25 the group consisting of: 

(i) CCC (SEQ ID NO:58); 

(ii) UUCG(SEQIDNO:59); 

(iii) CCACC (SEQ ID NO:60); 

(iv) CUCGAG (SEQ ID NO:61); 
30 (v) AAGCUU(SEQIDNO:62); 

(vi) CCACACC (SEQ ID NO:63); and 

(vii) UUCAAGAGA (SEQ ID NO:64). 

Another molecular biological approach that may be used is co-suppression. The 
mechanism of co-suppression is not well understood but is thought to involve post- 
35 transcriptional gene silencing (PTGS) and in that regard may be very similar to many 
examples of antisense suppression. It involves introducing an extra copy of a gene or a 
fragment thereof into a plant in the sense orientation with respect to a promoter for its 
expression. The size of the sense fragment, its correspondence to target gene regions, 
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and its degree of homology to the target gene are as for the antisense sequences 
described above. In some instances the additional copy of the gene sequence interferes 
with the expression of the target plant gene. Reference is made to Patent specification 
WO 97/20936 and European patent specification 0465572 for methods of implementing 
5 co-suppression approaches. The antisense, cosuppression or double stranded RNA 
molecules may also comprise a largely double-stranded RNA region, preferably 
comprising a nuclear localization signal, as described in PCT/AU03/00292. In a 
preferred embodiment, the largely double-stranded region is derived from a PSTVd type 
viroid or comprises at least 35 CUG trinucleotide repeats. 

10 

Transgenic Plants 

The term "plant" refers to whole plants, plant organs (e.g. leaves, stems roots, 
etc), seeds, plant cells and the like. Plants contemplated for use in the practice of the 
present invention include both monocotyledons and dicotyledons. Preferably, the plant 
15 is a horticultural plant or cotton. 

The term "cotton" as used herein includes any species of the genus Gossypium 
which is used for commercial, fibre production, preferably G. hirsutum or G. 
barbadense. 

Transgenic plants, as defined in the context of the present invention include 
20 plants (as well as parts and cells 9f said plants) and their progeny which have been 
genetically modified using recombinant techniques. This would generally be to either i) 
cause or enhance production of at least one protein of the present invention in the 
desired plant or plant organ, or ii) disrupt the production and/or activity of a polypeptide 
of the present invention. Transformed plants contain genetic material that they did not 
25 contain prior to the transformation. The genetic material is preferably stably integrated 
into the genome of the plant. The introduced genetic material may comprise sequences 
that naturally occur in the same species but in a rearranged order or in a different 
arrangement of elements, for example an antisense sequence. Such plants are included 
herein in "transgenic plants". A "non-transgenic plant" is one which has not been 
30 genetically modified with the introduction of genetic material by recombinant DNA 
techniques. 

Several techniques exist for introducing foreign genetic material into a plant cell. 
Such techniques include acceleration of genetic material coated onto microparticles 
directly into cells (see, for example, US 4,945,050 and US 5,141,131). Plants may be 
35 transformed using Agrobacterium technology (see, for example, US 5,177,010, US 
5,104,310, US 5,004,863, US 5,159,135). Electroporation technology has also been 
used to transform plants (see, for example, WO 87/06614, US 5,472,869, 5,384,253, 
WO 92/09696 and WO 93/21335). In addition to numerous technologies for 
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transforming plants, the type of tissue which is contacted with the foreign genes may 
vary as well. Such tissue would include but would not be limited to embryogenic tissue, 
callus tissue type I and II, hypocotyl, meristem, and the like. Almost all plant tissues 
may be transformed during development and/Or differentiation using appropriate 
techniques described herein. 

A particularly preferred method of producing a transgenic cotton plant is by 
Agrobacterium-mediated transformation of cotyledons, followed by the induction of 
callus formation, and the subsequent induction of embryogenic callus, and regeneration 
into plants. 

A number of vectors suitable for stable transfection of plant cells or for the 
establishment of transgenic plants have been described in, e.g., Pouwels et al., Cloning 
Vectors: A Laboratory Manual, 1985, supp. 1987; Weissbach and Weissbach, Methods 
for Plant Molecular Biology, Academic Press, 1989; and Gelvin et al., Plant Molecular 
Biology Manual, Kluwer Academic Publishers, 1990. Typically, plant expression 
vectors include, for example, one or more cloned plant genes under the transcriptional 
control of 5' and 3* regulatory sequences and a dominant selectable marker. Such plant 
expression vectors also can contain a promoter regulatory region (e.g., a regulatory 
region controlling inducible or constitutive, environmentally- or developmentally- 
regulated, or cell- or tissue-specific expression), a transcription initiation start site, a 
ribosome binding site, an RNA processing signal, a transcription termination site, and/or 
a polyadenylation signal. 

Examples of plant promoters include, but are not limited to ribulose-1,6- 
bisphosphate carboxylase small subunit, beta-conglycinin promoter, phaseolin promoter, 
high molecular weight glutenin' (HMW-GS) promoters, starch biosynthetic gene 
promoters, ADH promoter, heat-shock promoters and tissue specific promoters. 
Promoters may also contain certain enhancer sequence elements that may improve the 
transcription efficiency. Typical enhancers include but are not limited to Adh-intron 1 
and Adh-intron 6. 

Constitutive promoters direct continuous gene expression in all cells types and at 
all times (e.g., actin, ubiquitin, CaMV 35S). Tissue specific promoters are responsible 
for gene expression in specific cell or tissue types, such as the leaves or seeds (e.g., zein, 
oleosin, napin, ACP, globulin and the like) and these promoters may also be used. 
Promoters may also be active during a certain stage of the plants' development as well 
as active in plant tissues and organs. Examples of such promoters include but are not 
limited to pollen-specific, embryo specific, corn silk specific, cotton fibre specific, root 
specific, seed endosperm specific promoters and the like. 

In a particularly preferred embodiment, the promoter directs expression around 
anthesis which is when fibre initiation and elongation occur. Thus, it is preferred that 
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the promoter is an ovule or fibre specific promoter. Examples include promoters 
described in U.S. 5,495,070, U.S. 5,608,148 and U.S. 5,602,321. 

Under certain circumstances it may be desirable to use an inducible promoter. 
An inducible promoter is responsible for expression of genes in response to a specific 
5 signal, such as: physical stimulus (heat shock genes); light (RUBP carboxylase); 
hormone (Em); metabolites; and stress. Other desirable transcription and translation 
elements that function in plants may be used. 

In addition to plant promoters, promoters from a variety of sources can be used 
efficiently in plant cells to express foreign genes. For example, promoters of bacterial 
10 origin, such as the octopine synthase promoter, the nopaline synthase promoter, the 
mannopine synthase promoter; promoters of viral origin, such as the cauliflower mosaic 
virus (35S and 1 9S) and the like may be used. 

Vectors 

*5 One embodiment of the present invention includes a recombinant vector, which 

includes at least one isolated polynucleotide molecule of the present invention, inserted 
into any vector capable of delivering the nucleic acid molecule into a host cell. Such a 
vector contains heterologous nucleic acid sequences, that is nucleic acid sequences that 
are not naturally found adjacent to nucleic acid molecules of the present invention and 

20 that preferably are derived from a species other than the species from which the nucleic 
acid molecule(s) are derived. The vector can be either RNA or DNA, either prokaryotic 
or eukaryotic, and typically is a virus or a plasmid. 

One type of recombinant vector comprises a nucleic acid molecule of the present 
invention operatively linked to an expression vector. The phrase operatively linked 

25 refers to insertion of a nucleic acid molecule into an expression vector in a manner such 
that the molecule is able to be expressed when transformed into a host cell. As used 
herein, an expression vector is a DNA or RNA vector that is capable of transforming a 
host cell and effecting expression of a specified nucleic acid molecule. Preferably, the 
expression vector is also capable of replicating within the host cell. Expression vectors 

30 can be either prokaryotic or eukaryotic, and are typically viruses or plasmids. 
Expression vectors of the present invention include any vectors that function (i.e., direct 
gene expression) in recombinant cells of the present invention, including in bacterial, 
fungal, endoparasite, arthropod, other animal, and plant cells. Preferred expression 
vectors of the present invention can direct gene expression in plant cells. 

35 In particular, expression vectors of the present invention contain regulatory 

sequences such as transcription control sequences, translation control sequences, origins 
of replication, and other regulatory sequences that are compatible with the recombinant 
cell and that control the expression of nucleic acid molecules of the present invention. 
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In particular, recombinant molecules of the present invention include transcription 
control sequences. Transcription control sequences are sequences which control the 
initiation, elongation, and termination of transcription. Particularly important 
transcription control sequences are those which control transcription initiation, such as 

5 promoter, enhancer, operator and repressor sequences. Suitable transcription control 
sequences include any transcription control sequence that can function in at least one of 
the recombinant cells of the present invention. A variety of such transcription control 
sequences are known to those skilled in the art. 

Another embodiment of the present invention includes a recombinant cell 

10 comprising a host ceil transformed with one or more recombinant molecules of the 
present invention. Transformation of a nucleic acid molecule into a cell can be 
accomplished by any method by which a nucleic acid molecule can be inserted into the 
cell. Transformation techniques include, but are not limited to, transfection, 
electroporation, microinjection, lipofection, adsorption, and protoplast fusion, A 

15 recombinant cell may remain unicellular or may grow into a tissue, organ or a 
multicellular organism. Transformed nucleic acid molecules of the present invention 
can remain extrachromosomal or can integrate into one or more sites within a 
chromosome of the transformed (i.e., recombinant) cell in such a maimer that their 
ability to be expressed is retained. 

20 

Host cells 

Suitable host cells to transform include any cell that can be transformed with a 
polynucleotide of the present invention. Host cells can be either untransformed cells or 
cells that are already transformed with at least one nucleic acid molecule (e.g., nucleic 

25 acid molecules encoding one or more proteins of the present invention). Host cells of 
the present invention either can be endogenously (i.e., naturally) capable of producing 
proteins of the present invention or can be capable of producing such proteins after 
being transformed with at least one nucleic acid molecule of the present invention. Host 
cells of the present invention can be any cell capable of producing at least one protein of 

30 the present invention, and include bacterial, fungal (including yeast), parasite, 
arthropod, animal and plant cells. Preferred host cells are plant cells, in particular 
cotton cells. In a preferred embodiment, the cells are ovule cells such as the cells of the 
outer integument of cotton ovules. 

35 Antibodies 

The invention also provides monoclonal or polyclonal antibodies to polypeptides 
of the invention or fragments thereof. Thus, the present invention further provides a 
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process for the production of monoclonal or polyclonal antibodies to polypeptides of the 
invention. 

The term "binds specifically" refers to the ability of the antibody to bind to 
proteins of the present invention but not other proteins obtained of the plant, 
5 As used herein, the term "epitope" refers to a region of a protein of the invention 

which is bound by the antibody. An epitope can be administered to an animal to 
generate antibodies against the epitope, however, antibodies of the present invention 
preferably specifically bind the epitope region in the context of the entire protein. 

If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, 

10 goat, horse, etc.) is immunised with an immunogenic polypeptide. Serum from the 
immunised animal is collected and treated according to known procedures. If serum 
containing polyclonal antibodies contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaffinity chromatography. Techniques for 
producing and processing polyclonal antisera are known in the art. In order that such 

15 antibodies may be made, the invention also provides polypeptides of the invention or 
fragments thereof haptenised to another polypeptide for use as immunogens in animals 
or humans. 

Monoclonal antibodies directed against polypeptides of the invention can also be 
readily produced by one skilled in the art. The general methodology for making 

20 monoclonal antibodies by hybridomas is well known. Immortal antibody-producing cell 
lines can be created by cell fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein- 
Barr virus. Panels of monoclonal antibodies produced can be screened for various 
properties; i.e., for isotype and epitope affinity. 

25 An alternative technique involves screening phage display libraries where, for 

example the phage express scFv fragments on the surface of their coat with a large 
variety of complementarity determining regions (CDRs). This technique is well known 
in the art. 

For the purposes of this invention, the term "antibody", unless specified to the 
30 contrary, includes fragments of whole antibodies which retain their binding activity for 
a target antigen. Such fragments include Fv, F(ab f ) and F(ab')2 fragments, as well as 
single chain antibodies (scFv). Furthermore, the antibodies and fragments thereof may 
be humanised antibodies, for example as described in EP-A-239400. 

Antibodies of the invention may be bound to a solid support and/or packaged 
35 into kits in a suitable container along with suitable reagents, controls, instructions and 
the like. 

Preferably, antibodies of the present invention are detectably labeled. Exemplary 
detectable labels that allow for direct measurement of antibody binding include 
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radiolabels, fluorophores, dyes, magnetic beads, chemiluminescers, colloidal particles, 
and the like. Examples of labels which permit indirect measurement of binding include 
enzymes where the substrate may provide for a coloured or fluorescent product. 
Additional exemplary detectable labels include covalently bound en2ymes capable of 
5 providing a detectable product signal after addition of suitable substrate. Examples of 
suitable enzymes for use in conjugates include horseradish peroxidase, alkaline 
phosphatase, malate dehydrogenase and the like. Where not commercially available, 
such antibody-enzyme conjugates are readily produced by techniques known to those 
skilled in the art. Further exemplary detectable labels include biotin, which binds with 

10 high affinity to avidin or streptavidin; fluorochromes (e.g., phycobiliproteins, 
phycoerythrin and allophycocyanins; fluorescein and Texas red), which can be used 
with a fluorescence activated cell sorter; haptens; and the like. Preferably, the 
detectable label allows for direct measurement in a plate luminometer, e.g., biotin. Such 
labeled antibodies can be used in techniques known in the art to detect proteins of the 

15 invention. 

Assessing Fibre Properties 

Fibres produced from plants of the invention are compared to control fibres (e.g., 
fibres from wild-type plants or plants transformed with marker nucleic acids) to 

20 determine the extent of modulation of fibre properties. Modulation of fibre properties, 
such as fibre number, length, strength, or fineness, is achieved when the percent 
difference in these fibre properties of the plants of the invention and control plants is at 
least about 10%, preferably at least about 20%, most preferably at least about 30%. 

Several parameters can be measured to compare the properties or quality of 

25 fibres produced from plants of the invention to wild-type plants. These include: 1) fibre 
number, 2) fibre length; 3) fibre strength; and 4) fineness of fibres. 

A number of methods are known in the art to measure these parameters, such as 
described in U.S. 5,495,070. For example, instruments such as a fibrograph and HVI 
(high volume instrumentation) systems can be used to measure the length of fibres. The 

30 HVI systems can also be used to measure fibre strength. Fibre strength generally refers 
to the force required to break a bundle of fibres or a single fibre. In HVI testing, the 
breaking force is expressed in terms of "grams force per tex unit." This is the force 
required to break a bundle of fibres that is one tex unit in size. In addition, fineness of 
fibres can be measured, e.g., from a porous air flow test. In a porous air flow test, a 

35 weighed sample of fibres is compressed to a given volume and controlled air flow is 
passed through the sample. The resistance to the air flow is read as micronaire units. 
More specifically, the micronaire value is a measurement of cotton fibre quality that is a 
reflection of both fineness and maturity; low values indicate fine and/or immature fibre; 
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high values indicate coarse and/or mature fibres. These values are determined 
according to standard techniques by measuring the resistance offered by a plug of cotton 
to airflow (supra) that is influenced by a combination of fineness and maturity. 
Fineness is the outside diameter of the fibre that is measured in mTex (Millitex or 
5 mg/km). Maturity is the degree of wall thickening of the fibre. Short fibre count (w) % 
refers to the percentage of short fibre weight; short fibre count (n) % refers to the 
percentage of short fibre yield. Immature fibre count refers to the number of immature 
fibres, i.e., fibres in which the thickening of the fibre wall is appreciably less than 
normal. An increase in fibre yield (fibre weight/seed) can be measured by using the 
10 Advanced Fibre Information System (AFIS). Using these and other methods known in 
the art, one of skill can readily determine the extent of modulation of fibre 
characteristics, quality and/or yield plants produced by the methods of the invention. 

EXAMPLES 

15 Example 1 - Compa rison of fibre initials development in wild-type and lintless 
mutants of cotton. 

To identify genes that were specific to fibre initiation, genes were identified that 
were differentially expressed in early stage fertilised ovules of wild-type and lintless 
mutants of cotton that produce little if any fibres. The cotton lines (Gossypium hirsutum 

20 Z,.) used included two wild type cotton lines: Deltapine 16 (DP 16) and Xu-142, and 6 
lintless lines: Lintless 1A, Lintless 4A, Lintless 5B, Lintless 53, SL1-7-1 andfl. The 
1A, 4A, 5B, 53 lines were obtained from the Queensland Department of Primary 
Industry Tropical Crops and Pastures Germplasm Collection and were originally 
selections from a linted cultivar B1278 isolated by Dr Alistair Low (unpublished, 

25 CSIRO Irrigation Research, Griffith, NSW). SL1-7-1 was obtained from USDA-ARS 
(College Station, Texas, USA). The Fuzzless-lintless, fl, isolated from Xu-142 
background, and Xu-142 were provided by Prof. Xiao-Ya Chen (Institute of Plant 
Physiology, Chinese Academy of Science, Shanghai, China). All the cotton lines were 
grown in a glasshouse with temperature of 30 °C /22 °C (day/night). Ovules were 

30 always collected at a similar time each day (1-3 pm) and samples to be compared on a 
microarray were only used when they were collected on the same date and from the 
same glasshouse to minimise between time or location variability. 

The mutants were of varying provenance but it is not known whether they are 
allelic. The four lines Lintless 1 A, Lintless 4A, Lintless 5B, Lintless 53 were originally 

35 separate selections from a fully linted cultivar B1278 as spontaneous mutants that 
showed low but differing levels of lint production isolated as part of a breeding program 
to produce a cultivar with high quality seed oil and protein but with only a small amount 
of lint to retain the seeds in the boll capsule (Alistair Low, unpublished). Genetic 



38 



complementation testing has not yet been carried out. SL 1-7-1 (Mississipi Obsolete 
Collection Number 0504) was also a naturally occurring variant (Turley and Ferguson, 
1996) that produced less lint than the most lintless of the B1278 selections. All the 
mutant lines, but particularly 5B which had the most lint of any of the lines, show a 
5 variable, but low level of leakiness and this may be a result of environmental or 
physiological influences on lint production. At maturity the seeds of all of the mutant 
lines exhibit a fuzz-less phenotype (ie they have a completely naked seed and lack the 
short fuzz fibres covering the ovules of most Upland cotton varieties, including B1278). 
The growth rate and general vegetative and floral development of the mutants were 

10 similar to the wild type except that lintless 4A was slightly slimmer, and taller (about 
20%) than the Deltapine 16 (DP 16) variety used as the wildtype, whereas SL 1-7-1 (SL) 
exhibited higher levels of red anthocyanin pigments in all parts of the plant. All lines 
produced normal amounts of trichomes on their stems and leaves. 

Scanning electron microscopy was used to examine the fibre development of the 

15 5 lintless cotton mutants (Lintless 1A, 4 A, 5B, 53 and SL) at two days before anthesis (- 
2 dpa), the day of anthesis (0 dpa) and two days after anthesis (2 dpa). Cotton ovules 
were collected using the phyllotactic arrangement of cotton flowering nodes and size of 
cotton flower buds as indicator of the development stage as described by Hasenfratz et 
al., (1995). The collected ovules' were observed using an Oxford CT 1500 cryotrans 

20 system attached to a JEOL 6400 scanning electron microscope as described by Craig 
and Beaton (1996). There was generally no obvious difference between the mutants and 
the wild type (DP 16) at -2 dpa except that 4 A ovules were covered with mucous-like 
substances. The ovule surfaces were flat and epidermal cells were interspersed with 
stomata. On the day of anthesis, a few fibre initials became visible on the mutant ovules 

25 but these were considerably fewer in number than on wild type ovules. The mucous- 
like substances covering 4A ovule disappeared at this stage. The mutant phenotype was 
best revealed at two days post-anthesis when the fibres have begun to elongate. At this 
time, all of the mutants had a much reduced number of fibre initials on the surface of 
ovules and those fibres that had developed were slower to elongate and less 

30 synchronized in their elongation than the fibre initials of the wild type. The degree of 
lintlessness varied among the mutants with some lines showing very few fibre initials 
(Lintless 4A, 1A and 53) and the others more (Lintless 5B and SL-1-7-1), but still 
considerably fewer than DP 16. Early fibre growth was sensitive to environmental 
conditions demonstrated by the more rapid fibre growth of the wild type cotton grown in 

35 glasshouses with a temperature regime of 30°C/22°C (day/night) than in a glasshouse 
with temperature of 25°C/1 5°C. 
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Example 2 - Differential expression of genes in the mutant ovules compared to 
wild-type. 

cDNA Library Construction 

CHX cDNA library was constructed using cycloheximide treated ovules. Cotton 
5 flower buds of developmental stages of -3 dpa, -2 dpa, -1 dpa and 0 dpa were detached 
and pooled from glasshouse grown cotton plants and surface sterilized by dipping in 
70% ethanol and flaming twice. The cotton ovules were dissected out under sterile 
conditions and cultured on 15 ml of cotton ovule culture medium (Beasley and Ting, 
1973) supplemented with 5 \xM IAA (indole acetic acid) and 1 pM GA in 100 ml glass 

10 flasks at 29 °C in the dark overnight and then treated with 10 \xM cycloheximide for 4 
hours under the same culture conditions. After the cycloheximide treatment, the ovules 
were rinsed with sterile water, stored in RNAlater solution (Ambion) at -20 °C. Total 
RNA was isolated using a method described by Wu et al. (2002). Purification of poly A + 
mRNA from total RNA was carried out using Qiagen Oligotex mRNA kit (Cat. No. 

15 70042), following the manufacturer's protocol. In vitro translation was carried out to 
verify the bioactivity of the poly A + RNA (Wu et al. 2002). 5 ng poly A + mRNA from 
cycloheximide treated ovules was used for cDNA synthesis using a Life Technologies' 
Superscript Choice system (Cat. Series 18090) following the manufacturer's 
instructions. The first strand cDNA was synthesized using a mix of 1 \xq OHgo(dT)12- 

20 18 primer and 50 ng of random hexamers. The EcoKl adapted cDNA was size- 
fractionated and cDNA longer than 500 bp was randomly cloned in AZipLox EcoRI 
arms (Life Technology Cat No. 15397-029). This library was comprised of 2 x 106 
primary pfu with average insert size of 1 .05 kb. 

OCF cDNA library was developed from DP 16 0 dpa ovules. The total RNA 

25 isolation and poly A + mRNA purification was the same as for CHX library construction. 
5 fig poly A + mRNA was used for cDNA synthesis and cDNA library construction using 
a Life Technologies' Superscript lamda system (Cat. No. 19643-014) following the 
manufacturer's instructions. The first strand cDNA was synthesized using the NotI 
primer-adapter, in the presence of 1 |LtCi [ct- 32 P] dCTP. The Sail adapted and NotI 

30 digested cDNA was size-fractionated and the cDNA longer than 500 bp was 
directionally cloned in A,ZipLox NotI -Sail arms (Life Technology Cat No. 1 5397-029). 
This primary library was comprised of 5 x 10 5 pfu with average insert size of 0.9 kb. 

For both CHX and OCF libraries, the cDNA could be recovered in the 
autonomously-replicating plasmid pZLl using a in vivo excision protocol provided by 

35 Life Technology. Two bacterial strains (Life Technology) were used for the excision: 
DH10B (ZIP) for the preparation of double stranded plasmid DNA, and DH12S (ZIP) 
for the preparation of single stranded DNA by infection with helper phage M13K07 
(NEW ENGLAND BioLabs). 
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Normalization of the OCF library 

The OCF library was normalised using a method essentially as described by 
Bonaldo et al. (1996) with some modifications. Single-stranded plasmid DNA (ssDNA) 
was prepared from the excised OCF library using helper phage M13K07 (NEW 
ENGLAND BioLabs) and purified using the Bio-Gel HTP hydroxyapatite (HAP) 
column (Bio-Rad) according to Ali et al. (2000) with some modifications. The ssDNA 
was loaded on to a jacketed HAP column (Bio-Rad) at 60 °C and washed with 3 ml 10 
mM Na-phosphate. The column was washed with 3ml 0.1 6M Na-phosphate buffer and 
the eluate was collected in small fractions (about) 200 jul) and OD 260 was measured for 
each of the fractions. The fractions with the highest OD measurements were pooled and 
desalted using a Qiaquick PCR purification kit (Qiagen). The driver DNA was prepared 
from the same excised OCF library by Sail and NotI digestion of the double-stranded 
plasmid and the gel-separated cDNA fragments (smear on the gel) was purified using a 
Qiaquick gel extraction kit (Qiagen). Hybridization of driver with ssDNA tracer was 
performed in the presence of a 5 '-blocking oligo (5'-CCCACGCGTCCG-3') (SEQ ID 
NO:71), and a 3'-blocking oligo (5'-AAAAAAGGGCGGCC-3 ') (SEQ ID NO:72). The 
hybridization mix comprised 0.5 M NaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, 2 ug 
double-stranded driver DNA, 0.2 ug ssDNA, 10 jag of each of the blocking oligos, and 
25 ul formamide in a total volume of 50 ul. The driver DNA was heat denatured by 
boiling for 5 min prior to mixing with the rest of the components and the whole 
hybridisation mix was heated to 65°C for 3 min and then incubated at 42 °C for 24 
hours. The hybridisation mix was run through a HAP column under the same conditions 
as described for ssDNA purification to separate the ssDNA from the hybridised DNA. 
The conversion of purified ssDNA into double-stranded plasmid DNA and 
transformation of competent DH10B (Life Technology) was as described by Ali et al. 
(2000). The normalized library was designated the ON library consisting 10 s primary 
transformants with average insert size of 0.9 kb. 

PCR amplification ofcDNA clones and microarray preparation 

The cotton ovule cDNA microarray comprises a total of 10410 cDNA clones. 
Except for 52 clones encoding known cotton genes provided by colleagues, and 13 
clones of negative controls (non-plant genes, intron sequences etc.), the rest of the 
clones were randomly picked from the CHX, OCF and ON cDNA libraries, including 
5496 clones from CHX library, 1 149 clones from OCF library and 3700 clones from the 
ON library. 

All the anonymous cDNA clones from the ovule cDNA libraries were PCR 
amplified in 96 well PCR plates (AB gene). The PCR reaction contained of 2 mM 
MgCl 2 , 0.2 mM each of the dNTPs, 0.2 uM each of the M13/pUC forward and reverse 
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primers, 1 unit Taq F2 DNA polymerase and lx F2 buffer (BIOTECH International 
Limited) in a 50 pi reaction with 2 pi overnight cultured bacterial cells as template. A 
row of 12 samples from each PGR plate was verified by agarose gel electrophoresis. 
The PGR fragments were ethanol precipitated and resuspended in 10 pi 50% DMSO and 
5 8 pi of the fragments was transferred to 384 well plates for microarray slide printing. 
The PGR fragments were arrayed onto CMT-GAPS coated microarray slides (Corning) 
using a Virtek ChipWriter Pro (Virtek Biotech) arrayer. Post-printing slide processing 
was performed by baking the slides at 80 °C for 3 hours as described in the 
manufacturer's technical manual. - 

10 

Micoarray analysis 

Cotton ovules used for RNA isolation were kept in RNAlater solution (Ambion) 
at 4°C overnight and then stored at -20°C. For separating the ovule outer integument 
from the inner tissues, ovules stored in RNAlater were used and the separation was 

15 performed under a microscope at room temperature. Total RNA isolations were 
performed using a method described by Wu et aL, (2002). Purification of poly A + 
mRNA from total RNA was carried out using Oligotex mRNA kit (Qiagen, Cat. No. 
70042), following the manufacturer's protocol. The cotton ovule cDNA microarray 
comprises a total of 10410 cDNA clones. Except for 52 clones encoding known cotton 

20 genes, the clones were randomly picked from cDNA libraries constructed from DP- 16 
ovules of -3 dpa to 0 dpa. 

For microarray probe labelling, equal amounts of mRNA (0.5-1 pg) of two 
compared samples were reverse transcribed using Superscript II reverse transcriptase 
(Life Technologies), using a combination of 1 jag oligo(dT)12-18 primer and 6 |ag 

25 random primers (Life Technologies) per reaction. The purification and Cy3-dUTP and 
Cy5-dUTP (Amersham Pharmacia Biotech) labelling of the first strand cDNA was 
essentially as described by Schenk et al., (2000). The labelled probes were combined 
and purified using a Qiaquick PGR purification kit. The conditions for slide 
hybridisation and washing were as described in the manufacturer's instruction manual 

30 (Corning, CMT-GAPS coated slides). The microarray images were scanned using a 
GenePix 4000A microarray scanner (Axon Instruments, Union CA, USA). A typical 
microarray comparison consisted of 4 replicates unless otherwise specified. This 
included two biological replicates and each biological replication contained two dye- 
swapped hybridisations. In a time course comparison where RNA from DP 16 ovules of 

35 -4 dpa, -2 dpa and 4-2 dpa' were compared to RNA from 0 dpa DP 16 ovules, most 
comparisons consisted of 4 replications as mentioned above, however, self-comparison 
of 0 dpa to 0 dpa comprised 3 biological replications. 
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Scanned microarray images were analysed using the GenePix Pro program 
(Axon Instruments, Union CA, USA). Grids were predefined and manually adjusted to 
ensure optimal spot recognition and bad spots, eg. dust contamination etc., were 
flagged. Spots were quantified using the GenePix's fixed circle method, and medians of 
5 the fluorescence intensity of the red and green channels were used to calculate the ratio 
of the two channels. The data were log 2 transformed and normalised! using a spatial 
normalization method described by Wilson et ah, (2003). The data were then rescaled 
by dividing by an estimate of the median absolute deviation (Wilson et al., 2003) before 
running the "find differentially expressed gene" function of tRMA (tools for R 

10 Microarray Analysis available via http://www.pi.csiro.au/gena/). For a typical 
microarray comparison that consisted of 4 replications, the "find differentially expressed 
gene" function of tRMA was used to select differentially expressed genes from each of 
the replications separately, and the gene lists were then compared and genes occurring 
in at least 3 of 4 replications were classified as differentially expressed genes for this 

15 comparison. 

Gene expression in 0 dpa whole ovules of each of the mutants was compared to 
the wild-type DP 16 at the same stage using microarray analysis as described above to 
identify differentially expressed genes. The number of genes identified are shown in 
Table 2. Each experiment was replicated a number of times as both biological replicates 

20 and dye swap replicates. The total number of cDNA clones that are differentially 
expressed amongst the mutants compared to the wild-type varied significantly, from an 
average of 60 clones in the Lintless 4A/DP16 comparison up to an average of 243 
clones in the Lintless 53 /DP 16 comparison. In addition, the proportion of cDNA clones 
that are up or down-regulated also varied amongst the mutants with 4 mutant lines (1 A, 

25 53, 5B and SL) showing a higher number of cDNA clones that were up-regulated than 
down-regulated, and vise versa for the mutant Lintless 4A. These results may reflect the 
diverse genetic backgrounds or different genetic lesions of the mutants in addition to the 
variable amounts of lint produced by each line. 

30 TABLE 2. Number of cDNA clones that are up- or down-regulated in each of the 
mutants as compared to the wild type 



Comparison 


Genes 

Differentially 
Expressed 


Genes 
Up in 
Mutant 


Genes 
Down in 
Mutant 


Number of 
Replicates* 


Minimal 

Reproducibility* * 


4A/DP16 


60 


9(15%) 


51 (85%) 


8 


75% 


1A7DP16 


67, 


48 (72%) 


19 (28%) 


8 


75% 


SL/DP16 


102 


88 (86%) 


14 (14%) 


6 


83% 


5B/DP16 


144 


91 (63%) 


53 (37%) 


4 


75% 


53/DP16 


243 


199 (82%) 


44 (18) 


4 


75% 
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* Replicates consist both biological and technical replications. Each biological replicate 
(a RNA isolation) comprises two dye-swaped technical replicates. Number of 
biological replicates = Total number of replicates/2. 

** Genes identified as being significantly differentially expressed in at least 6 out of 8 
5 or 3 out of 4 replicates (75%), or 5 out of 6 replicates (83%). 

Example 3 - Identification of genes that are differentially expressed in seed coat 
outer integument of lintless mutants. 

Since the collected embryos described above may already have been pollinated 

10 and zygote development initiated, a separate microarray comparison was made between 
the mRNAs of the outer integument and those of the inner ovule tissues of the wild-type 
cotton, to filter out those genes that were not expressed specifically in the seed coat 
outer integument where fibres are initiated. Cotton ovules at 0 dpa are rapidly 
developing complex organs, composed of at least three separable layers of tissues: the 

15 outer integument, the inner integument and the nucellus (including a developing 
zygote). The genes identified as being differentially expressed in the mutant/wild-type 
comparisons might be constitutively expressed throughout the whole ovule or they may 
be expressed in only one or two of the layers. As cotton fibres develop only from the 
epidermal cells of the outer integument, genes that showed a higher expression level in 

20 this layer should be more relevant to fibre initiation and development compared to the 
genes that are predominantly expressed in the inner integument and nucellus. Outer 
integuments were therefore separated from the inner integuments and nucellus of 0 dpa 
wild type ovules by microdissection and labeled cDNA prepared from the partitioned 
tissues as described above. The gene expression in the outer integument was then 

25 compared to that of the inner integument and nucellus by probing the ovule cDNA 
microarray. The results, averaged over four replicates (two biological replicates each 
consisting of two dye-swapped technical replicates) revealed a total of 120 cDNA 
clones that were differentially expressed with 65 clones up-regulated and 55 clones 
down-regulated in the outer integument of wild type ovules. The list of 65 outer 

30 integument up-regulated clones was then used as a filter on the differentially expressed 
gene lists identified from the lintless mutant/wild type comparisons to select for cDNA 
clones that were up-regulated in the outer integument. This filtering resulted in the 
identification of a surprisingly small number of genes: 4, 7, 6, 10, 4 genes from the 1A, 
4A, 5B, 53 and SL mutants respectively that were both differentially expressed in 

35 mutant/wild type comparisons and up-regulated in the outer integument. 

There was significant overlap of the cDNA clones amongst the different mutants 
and in total only 11 unique cDNA clones were identified from this experiment as 
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potential candidate genes involved in early stage fibre development. The changes in 
relative expression in each mutant are summarized in Table 3. 



TABLE 3. Genes that were up- or down-regulated in the lintless mutants and up- 
5 regulated in the outer integument 3 



Name 


Clone No 


SL/ 
DP 


1A/ 
DP 


53/ 
DP 


4A/ 
DP 


5B/ 
DP 


OI/I 
I c 


Most Homologous 
gene d 


GhMyb25 


ON035F4 


-5.8 


-10.1 


10.8 


-10.8 


10.8 


4.6 


(AF336283) 
GHMYB25 


GhFaEl 


ON035N 
9 


-5.6 


-6.7 


-5.7 


-7 


-5.8 


4.7 


(NP_1 95909) 
Transferase 


GhFUl 


ON035C 
9 


-8.5 


-11 


14.1 


-15.2 


13.8 


2.7 


Unknown 


<x- 

Expansin 


Pfsl4x 


-6.8 


-7.6 


-9-1 


/ 


-9.1 


3.3 


(AF5 12539) 
Alpha-expansin 


GhFU2 


ON003F1 


/ 


/ 


-5.9 


-5.6 


-8.2 


2.8 


Unknown 


GhHDl 


ON033M 
7 


/ 


/ 


/ 


-5.5 


-4.9 


2.8 


(T05850) 

Homeobox protein 
ATML1 


GhTMTP 


CHX015 

K18 . 


/ 


/ 


/ 


-8.1 


-4.4 


4.7 


(NP_175557) ATP- 
dependent 
transmembrane 
transporter 


GhCycD3 

;i 


OCF07F4 


/ 


/ 


/ 


-5.3 


-3.7 


4.9 


(AAQ19972) 
Cyclin D3 


CHX007 
D10 b 


CHX007 
D10 


/ 


/ 


3.2 


/ 


/ 


4.4 


(AC084282) 
Putative protein 
phosphotase/(BAB 
83948) CIG1 


GhSus 


CHX002 
CIO 


/ 


/ 


/ 


/ 


r 3.5 


4.6 


(AAD28641) 
Sucrose synthase 


GhLTP 


ON033M 
19 


/ 


7 


/ 


/ 


-5.9 


3.2 


(AAM62634) Lipid 
transfer protein 



a The values presented in the table are the medians of Log2 transformed, normalized and 
rescaled ratios of the two compared samples. The rescaling was performed by dividing 
10 through by an estimate of the median absolute deviation (Wilson et al., 2003). 
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b Most probably a chimeric clone. 

c OI: Outer Integument; II: Inner Integument and Nucellus. 

d Most homologous gene based on the top BlastX identity score, with Genbank 
Accession No, and putative biochemical function. 

5 

Except for clone CHX007D10, which appears to be a chimeric clone, that was 
up-regulated in mutant Lintless 53, the rest of the clones were all down-regulated in the 
mutants. Three genes (corresponding to clones ON035F4, ON03SN9, ON035C9) were 
down-regulated in all 5 mutants and the other clones are down-regulated in up to 4 of 

10 the mutants. Sequence analysis showed a range of genes that had not been identified or 
characterized previously as important in early fibre development, including two 
transcription factors (GhMyb25 and a gene encoding a putative homeodomain protein); 
a cyclin D3 homolog; a transferase protein; a transmembrane transporter and two genes 
of unknown biochemical function. For ease of referring to the different genes, we have 

15 assigned them gene names that refers to their presumed functions, such as GhHDl to 
refer to the cotton homeodomain protein like gene represented by ON035N9 (Table 3). 

Example 4 - Expression patterns of genes in wild-type developing ovules. 

RNA from DP 16 ovules of -4 dpa, -2 dpa and +2 dpa was compared to RNA 

20 from 0 dpa DP 16 ovules using microarrays to profile the temporal changes in expression 
of genes around the time of fibre initiation (DPI 6 time course). RNA from -2 dpa and 
+2 dpa ovules from the Lintless 4A mutant, which shows more severe lintless 
phenotypes amongst the B1278 mutants, was compared with RNA from DP ovules of 
corresponding stages to reveal the temporal profiles of the genes inhibited in mutant 4A 

25 (4A/DP multi-time point comparison). The results of these experiments for the 
identified candidate genes are shown in Fig. 1 . 

The DP 16 time course showed two classes of expression profiles: Class I genes 
showed peak expression at 0 dpa; while Class II genes exhibited increased expression 
towards +2 dpa (Fig. 1. Column A). Three genes, GhMyb25, the GhHDl and 

30 GhCycD3;l, had a Class I expression profile suggesting a role in the early events of 
fibre initiation at anthesis. The expression of GhCycD3;l increased continually from -4 
dpa to 0 dpa and plateaued between 0 dpa and +2 dpa, while GhMyb25 and the GhHDl 
exhibit a dip in expression at -2 dpa followed by a peak at 0 dpa and then a decline 
towards +2 dpa. The peak expression at 0 dpa of these three genes coincided with the 

35 time of fibre initiation. The rest of the genes all showed a Class II expression pattern 
although the specific details differed among them. The expression of GhFUl and 
GhSus (sucrose synthase) increased gradually in the time period examined. GhFaEl 
(transferase family) and the expression of GhTMTP (transmembrane transporter) 



46 



showed a slight decrease from -4 dpa to -2 dpa and then increased gradually towards +2 
dpa. The third group from this, class comprised cc-Expansin and GhLTP (lipid transfer 
protein) which show a distinctly flat profile from -4 dpa to 0 dpa followed by a sharp 
increase from 0 dpa to 4-2 dpa. The last member from this class, GhFU2, exhibited 
5 increased expression from -4 dpa to -2 dpa and again from 0 dpa to +2 dpa, while the 
expression between -2 dpa to 0 dpa remained unchanged. 

The multiple time point comparisons of 4A/DP16 revealed the time and duration 
of up- or down-regulation of the genes in the 4A mutant relative to DP 16 (Fig. 1. 
Column B). Among the Class I genes, none showed significant repression relative to 

10 DP 1 6 at -2 dpa in 4 A ovules, instead, the repression started after -2 dpa and reached the 
lowest level at 0 dpa for the GhHDl and GhCycD3;l genes, while GhMyb25 continued 
to decrease slightly after 0 dpa. Class II genes, in a similar fashion, did not show any 
significant repression at -2 dpa, and repression started after -2 dpa for most of the genes 
except GhSus, which only showed a later repression after 0 dpa. All the Class II genes 

15 exhibited repression at about +2 dpa. Three Class II genes, GhSus, a-Expansin and 
GhLTP, were not identified as differentially expressed genes in the initial 0 dpa 4 A/DP 
comparisons. It became clear from this experiment that the repression of GhSus did not 
occur until after 0 dpa and only became significant at +2 dpa, while the repression of cc- 
Expansin and GhLTP began after -2 dpa only became highly significant towards +2 dpa 

20 when fibres were rapidly elongating in the wild-type. 

Example 5 - Confirmation of down-regulation of fibre initiation genes. 

The genetic background of the fibre mutants (B1278 and SL) used herein was not 
identical to that of the DP 16 control used in the comparisons. Neither parental genotype 

25 was available for use in the experiments described above. While the differential 
expression of genes observed between the mutants and the wild-type might have arisen 
due to differences in the genetic, backgrounds of the plants, the commonality of the 
genes identified among the different mutants and the fact that some have been 
characterized previously as important for fibre development using different strategies, 

30 suggested otherwise. A fuzzless-lintless (JT) mutant had recently been isolated from the 
Chinese G. hirsutum cultivar Xu-142 and used to identify and characterize fibre 
development related genes (Yu et al., 2000, Li et al, 2002, Ji et al., 2003). These two 
lines provided an isogenic pair with which to validate the genes identified from the other 
mutants. RNA from 0 dpa ovules of fl was compared to that of 0 dpa ovules of Xu-142 

35 and 119 clones were identified as differentially expressed in four replicates. The same 
outer/inner integument gene expression filter as described above was applied to the data 
set and identified 13 differentially expressed genes that were also up-regulated in the 
outer integument of DP 16 ovules. Amongst the 13 genes, 8 were in common with genes 
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identified in the other lintless mutant/DP16 comparisons and the results are presented in 
Table 4. This comparison also revealed 5 additional genes that had not been identified 
in the previous 5 mutants. One of the cDNA clones, ON038N8 (886 bp), encodes a Myb 
protein which was 69% identical to the GhMyb25 protein. Other genes include two 
5 different RD22 genes, a second LTP and a putative L-asparaginase. 



TABLE 4. Genes differentially expressed in fl as compared to Xu-142 





f * 1 j-\ r\ /-v TvT^-v 

vyione jno 


tl/Xu-142 


Most Homologous gene*** 


W# -L~rA.JJcll.io 111 


r IS 1 ^fX 


-11.5 


(AF5 12539) Alpha-expansin 


GhFUl* 


ON035C9 


-9.8 


Unknown 


\jriuviy dzd 


rWTAl CT? A 

UJN035r4 


-9.0 


(AF336283) GHMYB25 


GhFU2* 


ON003F1 


-7 R 


uiiiuiown 


GhFaEl* 


ON035N9 


-7.4 


(NP 195909) Transferase 


GhHDl* 


ON033M7 


-6.1 


(T05850) Homeobox protein ATML1 


GhLTP* 


ON033M19 


-4.9 


(AAM62634) Lipid transfer protein 


GhTMTP* 


CHX015K18 


-3.7 


(NP_1 75557) ATP-dependent 
transmembrane transporter 


GhMyb25-like 


ON038N8 


-4.4 . 


(AF336283) GHMYB25 


GhRD22 


OCF005C10 


-4.4 


(AAL67991). Dehydration-induced 
protein RD22 


GhAsp 


OCF008G9 


-3,4 


(BAC66615) L-aspraginase 


GhLTP2 


OCF010D8 


3.4 . 


(CAA65477) Non-specific lipid-transfer 
protein 


GhRD22-like 


OCF006C1 


5.3 


(BAC22498) Resistant specific protein- 1 



1 0 * Genes in common with the candidate genes from the other 5 mutants. 

** The values presented in the table are the medians of Log 2 transformed, normalized 
and rescaled ratios of the two compared samples. The rescaling was performed by 
dividing through by an estimate of the median absolute deviation (computed on the final 
residual mean-difference data) as described by Wilson et al., (2003). 

15 *** Based on the top BlastX hit. 

Example 6 - Ch aracterisation of GhMvb25 exp ression. 

The nucleotide sequence of the clone ON035F4 was obtained. It was 1160 
nucleotides in length with a coding region from nucleotides 68 to 995, encoding a 
20 protein which was 98% identical at the amino acid level to GhMyb25 (AF336283), 
expressed in 0 dpa ovules of G. hirsutum cultivar Acala Maxxa (Benjamin Burr, in 
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Genbank). The encoded protein was an.R2R3 type of Myb transcription factor. It was 
also 96% identical to the G. arboretum EST (BE054276), suggesting that it was from 
the A-genome present in tetraploid cotton. Outside the R2R3 region, which is highly 
conserved amongst all Myb transcription factors, GhMyb25 showed highest homology 
5 to the Petunia hybrida MYB.Ph3 and Antirrhinum majus MIXTA (AmMIXTA) than to 
Arabidopsis GL1 and cotton MYBA, another cotton myb which caused distinct 
abnormalities when over expressed in transgenic tobacco including the production of 
cotyledonary trichomes (Payne et al., 1999). The sequence of GhMyb25 currently in 
Genbank (AF336283) contained an unspliced intron that was not present in our clone 

10 (nucleotides 201-282 of AF336283). 

Reverse transcription-polymerase chain reaction (RT-PCR) assays were used to 
analyse expression of the gene as follows. Total RNA samples isolated from cotton 
tissues were DNase (RQ1 RNase-free DNase, Promega) treated and 0.5 |ag of the total 
RNA was used in a RT-PCR reaction, The first strand cDNA synthesis was performed 

15 using Superscript II reverse transcriptase and buffer supplied by the manufacturer (Life 
Technologies). The RT-PCR reaction was performed essentially as described by 
McFadden et al., (2001). The cotton p-tubulin gene was used as a control in all the RT- 
PCR reactions. The forward and reverse primers used for p-tubulin are 5'- 
AG AACATGATGTGTGCTGC-3 ' (SEQ ID NO:65) and 5'- 

20 AGCTGTG AACTGCTC ACTC-3 • (SEQ ID NO:66) respectively and the resulting 
cDNA fragment was 300 bp. The forward and reverse primers used for GhMyb25 RT- 
PCR were: 5'-TCAAACCCTCCTCAAAGCAACC-3' (SEQ ID NO:67) and 5'- 
ATTCCATTACCAGACGATGATGAC-3 ' (SEQ ID NO:68) respectively and this 
produced a cDNA fragment of 224bp. The GhMyb25 and P-tubulin RT-PCR reactions 

25 were performed in a one-tube reaction amplified with an initial denaturation cycle at 
95°C for 3 min followed by 23 cycles at 95°C for 15 sec, 55 °C for 15 sec, 72 °C for 1 
min and with a final cycle of 72°C for 2 min. 5 |nl of the RT-PCR reaction was checked 
on a 2% agarose gel and the gel was Southern-blotted to Hybond-N* membrane 
(Amersham Pharmacia Biotech) and hybridized with 32 P- labelled probe derived from 

30 the cDNA clone of GhMyb25. ' 

GhMyb25 expression was detected in 0 dpa wild-type ovules and not in petal, 
leaf and stem using RT-PCR (Fig. 2, panel a). GhMyb25 expression was detected in -2, 
0, 2 and 5 dpa ovules with highest expression in 0 and 2 dpa ovules (Fig. 2, panel b). 
No expression of GhMyb25 was visible in ovules of -2, 0 and 2 dpa from three of the 

35 lintless mutants, 1A, SL1-7-1 and 4A (Fig. 2, panel c) although very low expression in 
the mutant ovules was revealed after hybridizing RT-PCR products with an ON035F4 
probe (Fig. 2, panel d), consistent with the low levels of lint production that still occurs 
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on these ovules. GhMyb25 expression peaked at 0 dpa in lines 1A and SL1-7-1, whereas 
in 4A, the highest expression was detected slightly later at 2 dpa. 

Example 7 - Characterisation of the homeodomain protein gene GhHDl. 

5 ON033M7, a partial cDNA clone of 442 nucleotides, contained a small region at 

the 3 'end of its encoded protein with homology to two homeodomain proteins including 
protodermal factor 2 (NP_567274) (26/28 identical amino acids) and the LI -specific 
and ovule specific homeodomain gene ATML1 (T05850) (26/30, 86%). ATML1 has 
been grouped with- the Arabidopsis GLABRA2 in the same HD-GL2 class and they also 
10 share a common LI layer-specific or dermal-specific pattern of expression (Lu et al>, 
1996). The cotton gene was designated GhHDl. 

RT-PCR experiments were carried out to analyse expression, in similar fashion 
to those described above. The forward and reverse primers used for the GhHDl RT- 
PCR were; 

15 5'-GCTTTCTCTTGGATCAG-3' (SEQ ID NO:69) and 5'- 
CAATAACAC ATGAAACCAG-3 ' (SEQ ID NO:70) respectively and these resulted in 
a cDNA fragment of 384bp. The GhHDl and p-tubulin RT-PCR reactions were 
performed separately under the conditions described above. 10 jil of the RT-PCR 
reaction was electrophoresed on a 2% agarose gel and the gel was Southern-blotted to 

20 Hybond-N* membrane (Amersham Pharmacia Biotech) and hybridized with 32 P- 
labelled probe derived from the cDNA clone of the putative homeodomain gene. Since 
the expression of the p-tubulin appeared to be variable in different cotton tissues, the 
quantification of the GhHDl expression using p-tubulin gene as a standard was only 
performed on the 0 dpa ovules of different cotton lines using an Image-Quant program 

25 (Molecular Dynamics). 

GhHDl was mainly expressed in ovules of various developmental stages and at 
much lower levels in leaves as revealed by RT-PCR (Fig. 3). The expression increased 
at -1 dpa and remained high till 2 dpa. Low expression levels were observed in 4 dpa 
ovules (with fibres attached), 8 dpa ovules (without fibres) and 8 dpa detached fibres. 

30 The expression levels of 0 dpa ovules of mutants 5B and 4A relative to DPI 6 0 dpa 
ovules after normalization with p-tubulin expression is shown in Fig. 3. The expression 
in the mutant ovules was slightly reduced in 5B and remained similar to wild-type in 4A 
ovules. The RT-PCR band was confirmed as GhHDl -specific by Southern blot 
hybridization using the ON033M7 cDNA fragment as probe. 

35 

Example 8 - Characterisation of other genes. 

The characteristics of the genes identified in this study are summarized in Tables 
5 and 6. 
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Clone OCF07F4 encoded a cyclin D3, similar to AAQ19972, 106/149 similar 
amino acids, 71%). 

Example 9 - Fibre cells undergo DNA cndoreduplicatiou during initiation. 

5 The observation that a cyclin D3 gene encoded by clone OCF007F4 was down- 

regulated in 0 dpa ovules of mutants 4 A and 5B compared to DP 16 prompted the 
inventors to investigate the cell division and DNA replication activities of the 
epidermal layers of DP 16 and lintless 4 A ovules. Ovules of cotton line DP 16 and 4 A at 
-2, -l ? and 0 dpa stages were fixed in 3:1 (95% ethanol: acetic acid) for lhr at room 

10 temperature, cleared in 95%/lmM MgCl 2 ethanol over night at room temperature and 
rehydrated through an ethanol series to lOmMTris/lmM MgCl 2 according to 
Szymanski and Marks (1998). The ovules were stained in O.lfig/ml propidium iodide 
for 30 seconds and than distained and kept in lOmMTris/lmM MgCl 2 , 

Nuclear DNA content of ovule epidermal and fibre cells at the chalazal end were 

15 measured using a Leica SP2 *confocal laser scanning microscope (Leica, Wetzler, 
Germany). At least 200 nuclei were measured from each sample which consisted of at 
least 3 ovules. Fluorescence at 600-740 nm was collected after excitation at 488 and 
543 nm using a 63 X NA 1.25 water-immersion lens. After optically sectioning 
through the ovule epidermis, the mean fluorescence intensity and dimensions of 

20 epidermal and fibre cell nuclei was measured from the maximum projection of the 
optical stack. Total fluorescence of individual nuclei was calculated by multiplying 
nuclear area by average fluorescence. This value was converted to a ratio by 
normalising against total fluorescence of epidermal cell nuclei at telophase or anaphase 
(2C) within the same image. The normalized values were then used to construct 

25 histograms of epidermal and fibre cell nuclear DNA content. 

Since fibre cells are known to cease dividing after differentiation, the ovule 
epidermal cell division activities of DP 16 and 4A-183 were examined at and before 
anthesis. Over the period examined (-2, -1 and 0 dpa), the extent of cell division in the 
ovule epidermis of lintless 4A and DP 16 were not significantly different (paired West, 

30 P = 2.1%) although the division rate in 4A was slightly higher than in DP16 (Table 7). 
It appears that the ovule epidermis division rates are higher at -1 dpa for both DP 16 
and 4A, although the significance of this is unclear. 
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TABLE 7. Cell division rates in ovule epidermal cells of DP 16 and mutant 4A 



DPA 


DPI 6 


4A 


-2 dpa 


2.78% 


2.97% 


-1 dpa 


3.25% 


3.5% 


Odpa 


2.69% 


2.84% 5 



Relative DNA contents of the epidermal cells and fibre cells were also 
measured, and nonnalized using DNA contents of nuclei at anaphase or telophase (2C) 
and the results are presented in Fig. 4. Since the pre-fibre initials and epidermal cell 

10 are visually indistinguishable at -2 and -1 dpa, the data for those times are presented as 
total epidermal cells. The results indicate that epidermal pavement cells of DP 16 and 
Lintless 4A at these time points have a DNA content peak around 2 to 2.4C, while 
differentiated fibre initials of DP 16 when clearly distinguishable at 0 dpa, have an 
increased DNA content with the majority of cells showing a DNA content between 2.8 

1 5 C and 5.2C. While this result clearly suggests that the majority of fibre cells undergo at 
least one round of DNA endoreduplication during initiation, the involvement of the 
cyclin D3 gene in this process still needs to be verified. 

Example 10 - Cloning of full-length cPNA sequences and genes encoding therefor 

20 At least two approaches can be used to determine the full length sequence of 

partial cDNA clones described herein. 

One method is to screen a cDNA library, such as the cotton DP- 16 ovule -3 dpa 

to 0 dpa library described herein, with a radioactively labelled polynucleotide which 

comprises the known portion of the cDNA. Library screening is performed as 
25 described by Sambrook et al., (supra), or other techniques known to those of skill in the 

art. 

In another method, two primers of about 17 to about 20 nucleotides derived 
from both ends of the known partial sequence are synthesized and used to amplify the 
desired cDNA from a population of cDNA reverse transcribed with a poly-T 

30 comprising primer from mRNA obtained from, for example, cotton DP-16 ovule -3 dpa 
to 0 dpa. The polymerase chain reaction (PCR) is carried out under routine conditions, 
for instance, in 25 \xl of reaction mixture with 0.5 \xg of the above cDNA mixture. A 
convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) gelatin, 20 fxM each of 
dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. 

35 Thirty five cycles of PCR (denaturation at 94°C for 1 min; annealing at 55°C for 1 min; 
elongation at 72°C for 1 min) are performed with a Perkin-Elmer Cetus automated 
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thermal cycler. The amplified product is analyzed by agarose gel electrophoresis and 
the DNA band with expected molecular weight (as determined by Northen blot 
analsysi) is excised and purified. The PCR product is verified to be the selected 
sequence by subcloning and sequencing the DNA product. 
5 Several other methods are available for the identification of the 5' or 3 ! ends of 

an mRNA sequence. These methods include but are not limited to, filter probing, clone 
enrichment using specific probes, and protocols similar or identical to 5 f and 3' 
"RACE" protocols which are well known in the art. For instance, a method similar to 
5' RACE is available for generating the missing 5' end of a desired full-length 

10 transcript. Briefly, a specific RNA oligonucleotide is ligated to the 5* ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A primer 
set containing a primer specific to the ligated RNA oligonucleotide and a primer 
specific to a known sequence of the gene of interest is used to PCR amplify the 5' 
portion of the desired full-length gene. This amplified product may then be sequenced 

1 5 and used to generate the full length cDNA. 

To clone the corresponding gene of a cDNA described herein, a cotton genomic 
library is made in a A, vector and plaques obtained by plating 10 6 or more X infectious 
particles at high density in a suitable E. coli host, The plaques are transferred to nylon 
filters. A gene specific probe designed considering the polynucleotides provided herein 

20 is labelled with radioactive label and used to hybridise to the nylon filters. Plaques 
corresponding to spots of hybridisation are isolated and confirmed to be positive for 
desired sequence by second or third rounds of hybridisation. DNA sequencing of the 
gene segments in the X clones is carried out by standard methods to determine the full 
nucleotide sequence of the gene and the flanking upstream and downstream regions. 

25 

Example 11 - Production of transgenic cotton 

The coding sequence of a gene of the invention is operably linked to a 
subterranean clover stunt virus promoter (S7; WO 96/06932) and a 3' transcription 
termination and polyadenylation signal functional in plants. This chimeric gene is 
30 operably linked to a selectable marker gene and introduced into a T-DNA vector. 
Cotton plants are transformed using the Agrobacterium mediated transformation 
technique. Transgenic cotton lines are identified, fibre number, fibre length, fuzz fibre 
length, cellulose content, and dry weight of the lint is analyzed. 
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Example 12 - Discussion 

The use of multiple mutant lines in the above mentioned expression studies 
enabled confirmation and complementation of the findings from one mutant to another 
and focus in on the most critical genes for fibre development. It also helps to smooth 
5 out "noise" contributed by the biological variability in fibre growth and the unknown 
and maybe diverse genetic backgrounds of some of the mutants. The comparison of 
gene expression profiles between the outer integument and the inner ovule tissues 
served as a filter, to eliminate those genes that are not expressed at higher levels in the 
outer integument and helped to focus on a small set of about 10 candidate genes. The 

10 wild type time course data reveal that the expression profiles of these candidate genes 
separated them into two classes with class I genes (GhMyb25, GhHDl and 
GhCycD3;l) showing peak expression at 0 dpa, coinciding with the time of fibre 
initiation; class II genes exhibiting increased expression at 2 dpa, suggesting a more 
important role in fibre elongation. 

15 By analogy with the regulatory genes involved in Arabidopsis leaf trichome 

development, it might have been expected to find a Myb transcription factor expressed 
in cotton fibre that was similar to GLL GhMyb25 (and the GhMyb25-like gene), 
however, shows higher sequence similarity to the Petunia hybrida Myb,Ph3 and the 
Antirrhinum MIXTA than to GL1 or other cotton Mybs. Both Myb.Ph3 and MIXTA 

20 show petal epidermis-specific expression. Based on its expression pattern, it was 
speculated that the function of Myb.Ph3 was to regulate flavonoid biosynthesis (Solano 
et al., 1995), but this speculation has not been confirmed. The function of MIXTA has 
been revealed as a controlling factor for the conical shape of petal epidermal cells 
(Noda et al,, 1994). Over-expression of MIXTA in transgenic tobacco lead to 

25 production of supernumerary trichomes on cotyledons, leaves and stems as well as 
novel production of conical cells on leaves (Payne et al., 1999). In contrast, GhMyb25 
is expressed only in the ovules and not in later stage fibres (or petals so is not a 
homolog of MIXTA). It had higher expression in outer integuments and the time of 
expression coincide with fibre initiation. Accordingly, GhMyb25 plays a role in fibre 

30 initiation. The fact that GhMyb25 is down-regulated in all the lintless mutants, 
including the new fl mutant, points to a role as a positive regulator of fibre initiation. 

Comparison of the fl mutant with its parental genotype identified in addition to 
the 8 genes common to other mutants, a second Myb transcription factor, a GhMyb25~ 
like gene, containing a conserved region outside the R2R3 domain shared by all the 

35 MIXTA class of Mybs (Stracke et al., 2001). The GhMyb25-like gene is only 64% 
identical to probable A-genome derived GhMyb25 at the nucleotide level (69% 
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similarity at the amino acid level), suggesting it is unlikely to be the homoelogous D- 
genome partner of GhMyb25 present in tetraploid cotton. GhMyb25-like is expressed 
at low level in -4, ~2 dpa ovules of DP 16 (0.3 relative to 0 dpa ovule of 1) and increased 
sharply to 1 at 0 dpa and remained at a similar level of 1 at 2 dpa. This expression 
5 profile indicates a role for GhMyb25-like in fibre initiation. Among the 5 additional 
genes identified in this analysis, there were two RD22 genes, consistent with the 
findings of Li et al., (20.02) who identified a RD22 gene showing fibre specific 
expression using the same lines. 

The putative homeodomain gene identified in this study has high similarity in 

10 part of its C-terminus to the LI specific and ovule specific homeodomain gene 
ATML1. ATML1 was classified in the same HD-GL2 class as Arabidopsis GL2 based 
on sequence homology and they, share a common LI layer-specific or dermal-specific 
pattern of expression (Lu et al., 1996). ATML1 was proposed to be involved in setting 
up morphogenetic boundaries of positional information necessary for controlling cell 

15 specification and pattern formation based on gene expression patterns. GL2 that has 
been studied for its role in trichome, root-hair and seed coat development (Rerie et al., 
1994, Cristina et al., 1996, Masucci et al., 1996). The GL2 mutations resulted in 
aborted trichomes with aberrant cell expansion whereas entopic expression noticeably 
increased the number of trichomes and induced clusters of trichome formation (Ohashi 

20 et al., 2002). The GhHDl gene is expressed in ovules with higher expression in outer 
integument, in fibres as well as in leaves and this expression pattern probably reflects a 
more general role in different epidermal cell specification and pattern formation similar 
to that shown by the GL2. 

DNA endoreduplication, a strategy to amplify nuclear DNA without cell 

25 division is a major mechanism leading to somatic polyploidisation in plants (reviewed 
by Joubes and Chevalier 2000). Correlations have been established between 
polyploidy and cell differentiation and cell expansion. While it is well established that 
Arabidopsis trichomes undergo four rounds of endoreduplication during development, 
leading to branched cells with nuclei containing about 32C DNA (Schnittger et al., 

30 2002), it has been less than clear whether cotton fibre initials undergo a similar process. 
Berlin (1986) studied tritiated thymidine uptake by epidermal layer using in vitro 
cultured cotton ovules and observed that there was an increase in thymidine 
incorporation from -2 dpa to 1 dpa and then the incorporation declined and finally 
stopped at 6 dpa. These observations were interpreted as DNA synthesis in preparation 

35 for cell division. Since fibre initials do not undergo divisions and no thymidine 
incorporation was observed in the elongating fibres, the author suggested that gene 
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amplification did not occur during fibre development over the time observed (Berlin 
1986). Van't Hof (1998) reported that the DNA content of developing cotton fibre 
cells only increased by about 24% from 2 dpa to 5 dpa and suggested that during early 
stages of development fibre cell nuclei either selectively amplify certain sequences or 
5 enter S-phase replicating only a portion of their genome. Using laser-confocal 
microscopy and propidium iodide staining, we examined ovule epidermal cell division 
rates and DNA contents of epidermal cells and fibre cells. Our results show that the 
epidermal cell division rates remain relatively constant from -2 dpa to 0 dpa (with a 
small increase at -1 dpa). While nuclear DNA contents of epidermal cells remain 

10 largely unchanged from -2 dpa to 0 dpa, the . fibre initials contain nuclei that mostly 
show higher than 2C DNA content with the majority of cells showing DNA contents 
between 2.8 C and 5.2C. While our results suggest the fibre initials undergo one round 
DNA endoreduplication starting at 0 dpa, they do not exclude the possibility of 
selective amplification of certain sequences or partial replication of the genome at later 

15 stages suggested by Van't Hof s work. In addition, the enlarged nuclei in fibre initials 
revealed by the ultrastructural studies (review by Berlin 1986) provided further support 
to the DNA amplification phenomenon. 

Although accumulating data reveal that DNA endoreduplication is 
developmental^ regulated, it is still poorly understood in plants (reviewed by Joubds 

20 and Chevalier 2000). Assuming the endoreduplication is a modified cell cycle, it may 
share common determinants with the classic cell cycle (Joubes and Chevalier 2000). 
The two main control points in the cell cycle are at the Gl/S and G2/M transitions and 
in most plant cell types, the primary control point probably operates during Gl phase. 
This period not only includes the point of commitment to cell division, but may also 

25 represent the time during which differentiation decisions are made (reviewed by Meijer 
and Murray 2000). Mammalian cyclin D-Cdk4 complexes have been characterized as 
growth factor-responsive cell cycle regulators operating during Gl phase. Cyclin D3 
was found to be present at high levels in megakaryocytes undergoing endoreduplication 
and was upregulated following exposure to the proliferation, maturation and ploidy- 

30 promoting factor, Mpl ligand (Zimmet et al, 1997). In plants, the presence of mutiple 
Cyclin D3 genes raises the question of functional redundancy of these genes and the 
extent to which they may have distinct or overlapping roles (Meijer and Murray, 2000). 
Arabidopsis CycD3;l, which is highly cytokinin-inducible (Riou-Khamlichi et al. 
1999), when ectopically expressed, induced not only DNA replication but also cell 

35 division in trichomes (Schnittger et al., 2002). In synchronized tobacco BY-2 cell 
suspension cultures, tobacco CycD3; 2 was induced in Gl and remained at a constant 
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level through out the cell cycle, similar to mammalian D-type cyclins. In contrast, 
CycD3;l transcripts accumulated during mitosis, a pattern of expression not normally 
associated with D-type cyclins, suggesting a novel role for plant cyclins during mitosis 
or alternatively a BY-2 cell-specific phenomenon and not a normal feature of plant cell- 
5 cycle progression (Sorrell et al., 1999). The GhCycD3.1 identified in our experiments 
shows highest sequence homology to Euphobia esula cyclin D3;2. Since the decreased 
expression of this gene in the outer integuments of lintless mutant 4A-183 did not 
affect the epidermal cell division rates, it is appears that this gene is involved in the 
DNA endoreduplication of fibre initials similar to the tomato CycD3;Ts involvement 
10 in endoreduplication of the differentiated giant cells of the fruit gel tissue (Joubds et al., 
2000). 

It will be appreciated by persons skilled in the art that numerous variations 
and/or modifications may be made to the invention as shown in the specific 
embodiments without departing from the spirit or scope of the invention as broadly 
15 described. The present embodiments are, therefore, to be considered in all respects as 
illustrative and not restrictive. 

All publications discussed above are incorporated herein in their entirety. 

Any discussion of documents, acts, materials, devices, articles or the like which 
has been included in the present specification is solely for the purpose of providing a 
20 context for the present invention. It is not to be taken as an admission that any or all of 
these matters form part of the prior art base or were common general knowledge in the 
field relevant to the present invention as it existed before the priority date of each claim 
of this application. 



Dated this thirty-first day of March 2004 



Commonwealth Scientific and Industrial 
Research Organisation 
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CLAIMS 

1 . A method of altering fibre initiation and/or elongation in a fibre producing plant 
comprising manipulating said plant such that the production of a polypeptide is 

5 modified when compared to a wild-type plant, wherein the polypeptide comprises a 
sequence selected from the group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO T s:l to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:l to 16. 

10 

2. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO: 1 ; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:l . 

15 

3. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:2; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:2. 

20 . 

4. The method of claim 1, wherein the polypeptide comprises a sequence selected 
from the group consisting of: 

i) an amino acid sequence provided as SEQ ID NO:3; or 

ii) an amino acid sequence which is at least 80% identical to SEQ ID NO:3. 

25 

5. The method according to any one of claims 1 to 4 which comprises 
recombinantly expressing the polypeptide in said plant. 

6. The method according to any one of claims 1 to 4 which comprises reducing the 
30 level of the polypeptide endogenously produced by the plant. 

7. The method of claim 6, wherein the level of the polypeptide endogenously 
produced by the plant is reduced by exposing the plant to an antisense polynucleotide 
or a catalytic polynucleotide which hybridizes to an mRNA molecule encoding the 

35 polypeptide. 
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8. The method of claim 6, wherein the level of the polypeptide endogenously 
produced by the plant is reduced by exposing the plant to a dsRNA molecule that 
specifically down-regulates mRNA levels in a cell of an mRNA molecule encoding the 
polypeptide. 

5 

9. The method according to any one of claims 1 to 8, wherein the plant is a 
horticultural plant. 

10. The method according to any one of claims 1 to 8, wherein the plant is a species 
10 of the Genus Gossypium. 

11. A method of assessing the potential of a fibre producing plant to produce fibre, 
the method comprising analysing the plant for a genetic variation in a polynucleotide 
associated with fibre initiation and/or elongation, wherein the polynucleotide comprises 

15 a sequence selected from the group consisting of: 

i) a nucleotide sequence provided as any one of SEQ ID NO f s: 1 7 to 45; or 

ii) a nucleotide sequence which is at least 50% identical to any one of SEQ ID 
NO , s:17to45. 

20 12. The method of claim 11 which comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the production of an amplicon in said 
amplification reaction indicates an association with fibre producing potential. 

25 13. The method of claim 1 1 which comprises performing an amplification reaction 
on nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic 
acids from said plant as a template, wherein the lack of production of an amplicon in 
said amplification reaction indicates an association with fibre producing potential. 

30 14. The method of claim 1 1 which comprises performing a hybridization reaction on 
nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic acids 
from said plant as a template, wherein a detectable signal produced by the hybridization 
reaction indicates reduced fibre producing potential. 

35 15. The method of claim 11 which comprises performing a hybridization reaction on 
nucleic acids obtained from said plant, or nucleic acids synthesized using nucleic acids 
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from said plant as a template, wherein the lack of a detectable signal by the 
hybridization reaction indicates reduced fibre producing potential. 

16. The method of claim 1 1, wherein the polynucleotide is mRNA and the method 
comprises determining the levels of mRNA of the polynucleotide in the plant ovule at, 
or around, anthesis. 

17. A method of assessing the potential of a fibre producing plant to produce fibre, 
the method comprising analysing the plant for polypeptide involved in fibre initiation 
and/or elongation, wherein the polypeptide comprises a sequence selected from the 
group consisting of: 

i) an amino acid sequence provided as any one of SEQ ID NO ! s:l to 16; or 

ii) an amino acid sequence which is at least 50% identical to any one of SEQ ID 
NO's:ltol6. 

18. The method of claim 17, wherein the method comprises determining the levels 
of the polypeptide in the plant ovule at, or around, anthesis. 

19. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:l, 

ii) . a polypeptide comprising an amino acid sequence which is at least 87% 
identical to SEQ ID NO: 1 , and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

20. The polypeptide of claim 19, wherein the polypeptide comprises an amino acid 
sequence which is at least 95% identical to SEQ ID NO:l. 

21. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO:2, and 

ii) a biologically active fragment of i), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
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22. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:3, 

ii) a polypeptide comprising an amino acid sequence which is at least 54% 
5 identical to SEQ ID NO:3, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

23 . A substantially purified polypeptide selected from the group consisting of: 

10 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:4, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO:4, and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation, 

24. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:5, 

20 ii) a polypeptide comprising an amino acid sequence which is at least 50% 

identical to SEQ ID NO:5, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

25 25. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:6, 

ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO:6, and 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

26. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

35 NO:7, 
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ii) a polypeptide comprising an amino acid sequence which is at least 79% 
identical to SEQ ID NO:7, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

27. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:8, 

ii) a polypeptide comprising an amino acid sequence which is at least 66% 
identical to SEQ ID NO:8, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

28. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO:9, 

ii) a polypeptide comprising an amino acid sequence which is at least 95% 
identical to SEQ ID NO:9, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

29. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 10, 

ii) a polypeptide comprising an amino acid sequence which is at least 67% 
identical to SEQ ID NO: 1 0, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

30. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO:ll, 

ii) a polypeptide comprising an amino acid sequence which is at least 55% 
identical to SEQ ID NO: 1 1, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
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31. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 12, 

ii) a polypeptide comprising an amino acid sequence which is at least 59% 
5 identical to SEQ ID NO: 1 2, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

32. A substantially purified polypeptide selected from the group consisting of: 

10 i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 

NO: 13, 

ii) a polypeptide comprising an amino acid sequence which is at least 77% 
identical to SEQ ID NO: 13, and 

iii) a biologically active fragment of i) or ii), 

1 5 wherein the polypeptide regulates fibre initiation and/or elongation. 

33. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO: 14, 

20 ii) a polypeptide comprising an amino acid sequence which is at least 50% 

identical to SEQ ID NO: 14, and 

iii) a biologically active fragment of i) or ii), 
wherein the polypeptide regulates fibre initiation and/or elongation. 

25 34. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
NO:15, 

ii) a polypeptide comprising an amino acid sequence which is at least 64% 
identical to SEQ ID NO: 15, and 

30 iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 

35. A substantially purified polypeptide selected from the group consisting of: 

i) a polypeptide comprising an amino acid sequence as provided in SEQ ID 
35 NO: 16, 
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ii) a polypeptide comprising an amino acid sequence which is at least 50% 
identical to SEQ ID NO: 1 6, and 

iii) a biologically active fragment of i) or ii), 

wherein the polypeptide regulates fibre initiation and/or elongation. 
5 . 

36. The polypeptide according to any one of claims 19 to 35, wherein the 
polypeptide can be purified from a. species of the Genus Gossypium. 

37. The polypeptide according to any one of claims 19 to 36, which is a fusion 
10 protein further comprising at least one other polypeptide sequence. 

38. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 1 7; 
15 ii) a sequence of nucleotides as provided in SEQ ID NO: 18; 

iii) a sequence encoding a polypeptide according to claim 19 or claim 20; 

iv) a sequence of nucleotides which is at least 87% identical to SEQ ID NO: 17 
or SEQ ID NO: 18; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
20 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:46. 

39. An isolated polynucleotide comprising a sequence of nucleotides selected from 
25 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 1 9, 

ii) a sequence of nucleotides as provided in SEQ ID NO:20, 

iii) a sequence encoding a polypeptide according to claim 21, and 

iv) a sequence complementary to any one of i) to iii). 

30 

40. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:21 ; 

ii) a sequence of nucleotides as provided in SEQ ID NO:22; 
35 iii) a sequence encoding a polypeptide according to claim 22; 
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iv) a sequence of nucleotides which is at least 54% identical to SEQ ID NO:21 
or SEQ ID NO:22;and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided hi SEQ ID NO:47. 

41. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

10 i) a sequence of nucleotides as provided in SEQ ID NO:23; 

ii) a sequence encoding a polypeptide according to claim 23; 

iii) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:23; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
15 conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:48. 

42. An isolated polynucleotide comprising a sequence of nucleotides selected from 
20 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:24; 

ii) a sequence of nucleotides as provided in SEQ ID NO:25; 

iii) a sequence encoding a polypeptide according to claim 24; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:24 
25 orSEQIDNO:2S;and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:49. 

30 

43. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:26; 

ii) a sequence of nucleotides as provided in SEQ ID NO:27; 
35 iii) a sequence encoding a polypeptide according to claim 25; 
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iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:26 
or SEQ ID NO:27; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:50. 

44. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO:28, 

ii) a sequence of nucleotides as provided in SEQ ID NO:29, 

iii) a sequence encoding a polypeptide according to claim 26, 

iv) a sequence complementary to any one of i) to iii). 

15 45. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:30; 

ii) a sequence of nucleotides as provided in SEQ ID NO: 3 1 ; 

iii) a sequence encoding a polypeptide according to claim 27; 

20 iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO: 30 

orSEQIDNO:31;and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
25 provided in SEQ ID NO:51. 

46. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO: 32, 
30 ii) a sequence of nucleotides as provided in SEQ ID NO:33, 

iii) a sequence encoding a polypeptide according to claim 28, 

iv) a sequence complementary to any one of i) to iii), 

47. An isolated polynucleotide comprising a sequence of nucleotides selected from 
35 the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:34; 
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ii) a sequence of nucleotides as provided in SEQ ID NO:35; 

iii) a sequence encoding a polypeptide according to claim 29; 

iv) a sequence of nucleotides which is at least 70% identical to SEQ ID NO:34 
orSEQIDNO:35;and 

5 v) a sequence which hybridizes to any one of i) to iv) tinder high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:52. 

10 48. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:36; 

ii) a sequence of nucleotides as provided in SEQ ID NO:37; 

iii) a sequence encoding a polypeptide according to claim 30; 

15 iv) a sequence of nucleotides which is at least 55% identical to SEQ ID NO:36 

or SEQ ID NO:37; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
20 provided in SEQ ID NO:53. 

49. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:38; 
25 ii) a sequence encoding a polypeptide according to claim 3 1 ; 

iii) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:38; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions. 

30 

50. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:39; 

ii) a sequence of nucleotides as provided in SEQ ID NO:40; 
35 iii) a sequence encoding a polypeptide according to claim 32; 



74 



iv) a sequence of nucleotides which is at least 95% identical to SEQ ID NO:39 
or SEQ ID NO:40; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO: 54. 

51. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 
10 i) a sequence of nucleotides as provided in SEQ ID NO:41 ; 

ii) a sequence of nucleotides as provided in SEQ ID NO:42; 

iii) a sequence encoding a polypeptide according to claim 33; 

iv) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:41 
or SEQ ID NO:42; and 

15 v) a sequence which hybridizes to any one of i) to iv) under high stringency 

conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
provided in SEQ ID NO:55. 

20 52. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:43; 

ii) a sequence of nucleotides as provided in SEQ ID NO:44; 

iii) a sequence encoding a polypeptide according to claim 34; 

25 iv) a sequence of nucleotides which is at least 65% identical to SEQ ID NO:43 

or SEQ ID NO:44; and 

v) a sequence which hybridizes to any one of i) to iv) under high stringency 
conditions, 

wherein the polynucleotide does not comprise a sequence of nucleotides as 
30 provided in SEQ ID NO:56. 

53. An isolated polynucleotide comprising a sequence of nucleotides selected from 
the group consisting of: 

i) a sequence of nucleotides as provided in SEQ ID NO:45; 
35 ii) a sequence encoding a polypeptide according to claim 35; 
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iii) a sequence of nucleotides which is at least 50% identical to SEQ ID NO:45; 

and 

iv) a sequence which hybridizes to any one of i) to iii) under high stringency 
conditions, 

5 wherein the polynucleotide does not comprise a sequence of nucleotides as 

provided in SEQ ID NO:57. 

54. A catalytic polynucleotide capable of cleaving a polynucleotide according to 
any one of claims 38 to 53. 

10 

55. The catalytic polynucleotide of claim 54 which is a ribozyme. 

56. An oligonucleotide which comprises at least 19 contiguous nucleotides of a 
polynucleotide according to any one of claims 38 to 53. 

15 

57. A double stranded RNA (dsRNA) molecule comprising an oligonucleotide 
according to claim 56, wherein the portion of the molecule that is double stranded is at 
least 19 basepairs in length and comprises said oligonucleotide. 

20 58. The dsRNA molecule of claim 57 which is expressed from a single promoter, 
wherein the strands of the double stranded portion are linked by a single stranded 
portion. 

59. A vector comprising or encoding the polynucleotide according to any one of 
25 claims 38 to 53. 

60. The vector of claim 59 wherein the polynucleotide is operably linked to an ovule 
or fibre specific promoter. 

30 61. A vector comprising or . encoding oligonucleotide of claim 56 or the dsRNA 
molecule of claim 57. 

62. A host cell comprising the vector according to any one of claims 59 to 61 . 



63. A transgenic plant, the plant having been transformed with polynucleotide 
according to any one of claims 38 to 53 or the oligonucleotide of claim 56. 
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64. The transgenic plant of claim ,63 wherein the polynucleotide is capable of 
expression to produce a polypeptide according to any one of claims 19 to 37. 

65. A transgenic plant, the plant having been transformed such that it produces a 
5 catalytic polynucleotide of claim 54 or claim 55, or a dsRNA molecule of claim 57 or 

claim 58. 

66. The transgenic plant of claim 63 or claim 65, wherein the polynucleotide, 
catalytic polynucleotide or dsRNA down-regulates the production of a polypeptide 

1 0 according to any one of claims 1 9 to 37 which is endogenously produced by the plant. 

67. A substantially purified antibody, or fragment thereof, that specifically binds a 
polypeptide according to any one of claims 19 to 37. 

15 68. A method of breeding a fibre producing plant, the method comprising 
performing a method according to any one of claims 1 1 to 18. 

M 69. A method of selecting from a breeding population a fibre producing plant with 
altered fibre initiation and/or elongation potential, the method comprising; 
20 i) crossing two plants which have differing potential to produce fibre, 

ii) performing a method according to any one of claims 11 to 18 on progeny 

plants, 

iii) selecting a progeny plant with altered fibre initiation and/or elongation 
•potential when compared to a parent plant. . 

25 

70. A plant produced by the method of claim 68 or claim 69. 

71 . Seed of a plant of any one of claims 63 to 66 or claim 70. 

30 72. Fibre of a plant of any one of claims 63 to 66 or claim 70. 

73. A method of identifying an agent which alters fibre initiation and/or elongation 
of a fibre producing plant, the method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
35 NO's:l to 16 to a candidate agent, and 
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b) assessing the ability of the candidate agent to modulate the activity of the 
polypeptide. 

74. A method of identifying an agent which alters fibre initiation and/or elongation 
5 of a fibre producing plant, the method comprising 

a) exposing a polypeptide which is at least 50% identical to any one of SEQ ID 
NO's:l to 16 to a binding partner which binds the polypeptide, and a candidate agent, 
and 

b) assessing the ability of the candidate agent to compete with the binding 
1 0 partner for binding to the polypeptide. 

75. The method of claim 74, wherein the binding partner is detectably labeled. 

76. A method of identifying an agent which alters fibre initiation and/or elongation 
15 of a fibre producing plant, the method comprising 

a) exposing a polynucleotide encoding a polypeptide which is at least 50% 
identical to any one of SEQ ID NO's:l to 16 to a candidate agent'under conditions 
which allow expression of the polynucleotide, and 

b) assessing the ability of the candidate agent to modulate levels of polypeptide 
20 produced by the polynucleotide. 

77. A method of identifying an agent which alters fibre initiation and/or elongation 
of a fibre producing plant, the method comprising 

a) exposing a polynucleotide which is at least 50% identical to any one of SEQ 
25 ID NO's: 17 to 45 to a candidate agent, and 

b) assessing the ability of the candidate agent to hybridize and/or cleave the 
polynucleotide. 
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